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Abstract 

The cognitive interference channel is an interference channel in which one transmitter is non-causally provided with the 
message of the other transmitter. This channel model has been extensively studied in the past years and capacity results for certain 
classes of channels have been proved. In this paper we present new inner and outer bounds for the capacity region of the cognitive 
interference channel as well as new capacity results. Previously proposed outer bounds are expressed in terms of auxiliary random 
variables for which no cardinality constraint is known. Consequently it is not possible to evaluate such outer bounds explicitly 
for a given channel model. The outer bound we derive is based on an idea originally devised by Sato for the broadcast channel 
and does not contain auxiliary random variables, allowing it to be more easily evaluated. The inner bound we derive is the largest 
known to date and is explicitly shown to include all previously proposed achievable rate regions. This comparison highlights 
which features of the transmission scheme - which includes rate-splitting, superposition coding, a broadcast channel-like binning 
scheme, and Gel'fand Pinsker coding - are most effective in approaching capacity. We next present new capacity results for a class 
of discrete memoryless channels that we term the "better cognitive decoding regime" which includes all previously known regimes 
in which capacity results have been derived as special cases. Finally, we determine the capacity region of the semi-deterministic 
cognitive interference channel, in which the signal at the cognitive receiver is a deterministic function of the channel inputs. 



I. Introduction 

The rapid advancement of wireless technology in the past years has started what some commentators call the "wireless 
revolution" [JJ. This revolution envisions a world where one can access telecommunication services on a global scale without 
the deployment of local infrastructure. By increasing the adaptability, communication and cooperation capabilities of wireless 
QQ devices, it may be possible to realize this revolution. Presently, the frequency spectrum is allocated to different entities by 
fS| dividing it into licensed lots. Licensed users have exclusive access to their licensed frequency lot or band and cannot interfere 
CO with the users in neighboring lots. The constant increase of wireless services has led to a situation where new services have a 
difficult time obtaining spectrum licenses, and thus cannot be accommodated without discontinuing, or revoking, the licenses 
CO of others. This situation has been termed "spectrum gridlock" ( |2|) and is viewed as one of the factors in preventing the 
I emergence of new services and technologies by entities not already owning significant spectrum licenses. 
• In recent years, several strategies for overcoming this spectrum gridlock have been proposed fT\. In particular, collaboration 
. . , among devices and adaptive transmission strategies are envisioned to overcome this spectrum gridlock. That is, smart and well 
interconnected devices may cooperate to share frequency, time and resources to communicate more efficiently and effectively. 
The role of information theory in this scenario is to determine ultimate performance limits of a collaborating network. Given 
5J] the complexity of this task in its fullest generality, researchers have focussed on simpler models with idealized assumptions. 
One of the most well studied and simplest collaborative models is the genie aided cognitive interference channel. This 
channel is similar to the classical interference channel: two senders wish to send information to two receivers. Each transmitter 
has one intended receiver forming two transmitter-receiver (Tx-Rx) pairs termed the primary and secondary (or cognitive) 
pairs/users. Over the channel each transmitted message interferes with the other, creating undesired interference at the intended 
receiver This channel model differs from the classical interference channel in the assumptions made about the ability of 
the transmitters to collaborate: collaboration among transmitters is modeled by the idealized assumption that the secondary 
(cognitive) transmitter has full a-priori (or non-causal) knowledge of the primary message. This assumption is referred to 
as genie aided cognitiorQ. The model was firstly posed from an information theoretic perspective in |3], where the channel 
was formally defined and the first achievable rate region was obtained, demonstrating that a cognitive interference channel, 
employing a form of asymmetric transmitter cooperation, could achieve larger rate regions than the classical interference 
channel. The first outer bound for this channel was derived in |4|, together with the first capacity result for a class of channels 
termed "very weak interference" in which (in Gaussian noise) treating interference at the primary user as noise is optimal. 
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The same achievable rate region was simultaneously derived in ||5], where the authors further characterized the maximum 
rate achievable by the cognitive user without degrading the rate achievable by the primary user. A second capacity result was 
proved in |6| for the so-called "very strong interference case", where, without loss of optimality, both receivers can decode 
both messages. The capacity is also known for the case where the cognitive user decodes both messages Q with and without 
confidentiality constraints. 

However, the capacity region of the genie aided cognitive radio channel, both for discrete memoryless as well as Gaussian 
noise channels, remains unknown in general. Tools such as rate-splitting, binning, cooperation and superposition coding have 
been used to derive different achievable rate regions. The authors of |^ proposed an achievable region that encompasses all 
the previously proposed inner bounds and derived a new outer bound using an argument originally devised for the broadcast 
channel in ||9l. A further improvement of the inner bound in [8| is provided in ifTOl where the authors include a new feature in 
the transmission scheme allowing the cognitive transmitter to broadcast part of the message of the primary pair This broadcast 
strategy is also encountered in the scheme derived in 1 11 1 for the more general broadcast channel with cognitive relays, which 
contains the cognitive interference channel as special case. 

Many extensions to the cognitive interference channel have been considered. In particular, several papers have addressed the 
cognitive interference channel's idealized cognition assumption of non-causal, or a-priori message knowledge at one transmitter. 
A more realistic model of cognition is obtained by assuming a finite (rather than infinite) capacity link(s) between the encoders 
- termed the interference channel with conferencing encoders. Under the "strong interference condition", this channel model 
reduces to the compound multiple access channel whose capacity was determined in |12|. Another non-idealized model for 
cognition is the "causal cognition" model in which the cognitive encoder has access to a channel output and causally learns 
the primary message. This models is a special case of the interference channel with generalized feedback of |13|, which 
has considered in |14| where an achievable scheme using block Markov encoding was derived . In |15|, the impact of the 
knowledge of different codebooks is investigated. 

Another natural extension of the cognitive interference channel model is the so called "broadcast channel with cognitive 
relays" or "interference" channel with one cognitive relay". In this channel model, a cognitive relay in inserted in a classical 
interference channel. The cognitive relay has knowledge of the two messages and thus cooperates with the two encoders in the 
transmission of these two messages. The model contains both the interference channel and the cognitive interference channel 
when removing one of the transmitters and message knowledge (for the interference channel) and thus can reveal the optimal 
cooperation trade off between entities in a larger network. This model was first introduced in lfT6l . where an achievable rate 
region was derived. In [[7] the authors introduced a larger achievable rate region and derived an outer bound for the sum 
capacity. In 1 11 1 a yet larger inner bound is derived by having the cognitive transmitter send a private message to both receivers 
as in a broadcast channel. 

A. Main contributions 

In this paper we establish a series of new results for the discrete memoryless cognitive interference channel. Section 
introduces the basic definitions and notation. Section |III] summarizes the known results including general inner bounds, outer 
bounds and capacity in the "very weak interference" and "very strong interference" regimes. Our contributions start in Section 
HV] and may be summarized as follows: 

> A new outer bound for the capacity region is presented in Section |IVl this outer bound is looser than previously derived 
outer bounds but it does not include auxiliary random variables and thus it can be more easily evaluated. 

> In Section [V] we present a new inner bound that encompasses all known achievable rate regions. 

• We show that the newly derived region encompasses all previously presented regions in Section |Vl] 

> We derive the capacity region of the cognitive interference channel in the "better cognitive decoding" regime in Section 
IVIII this regime includes the "very weak interference" and the "very strong interference" regimes and is thus the largest 
set of channels for which capacity is known. 

• Section IVIIII focuses on the semi-deterministic cognitive interference channel in which the output at the cognitive receiver 
is a deterministic function of the channel inputs. We determine capacity for this channel model by showing the achievabihty 
of the outer bound first derived in ID. 

• In Section |IX] we consider the deterministic cognitive interference channel: in this case both channel outputs are deter- 
ministic functions of the inputs. This channel is a subcase of the semi-deterministic case for which capacity is known. 
For this channel model we show the achievabihty of the outer bound proposed in section IIVI thus showing that this outer 
bound is tight for this class of channels. 

• The paper concludes with some examples in Section |X] which provide insight on the role of cognition. We consider two 
deterministic cognitive interference channel and show the achievabihty of the outer bound of Section |IV] with transmission 
strategies over one channel use. The achievable scheme we propose provides interesting insights on the capacity achieving 
scheme in this channel model - the extra non-causal message knowledge at one of the transmitters allows a partial joint 
design of the codebooks and transmission strategies - and is easily appreciated in these simple deterministic models. 
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II. Channel model, notation and definitions 

A two user InterFerence Channel (IFC) is a multi-terminal network with two senders and two receivers. Each transmitter i 
wishes to communicate a message Wi to receiver i, i = {l,2}. In the classical IFC the two transmitters operate independently 
and have no knowledge of each others' messages. Here we consider a variation of this set up assuming that transmitter 1 (also 
called cognitive transmitter), in addition to its own message Wi, also knows the message W2 of transmitter 2 (also called 
primary transmitter). We refer to transmitter/receiver 1 as the cognitive pair and to transmitter/receiver 2 as the primary pair 
This model, shown in Figure [T] is termed the Cognitive InterFerence Channel (CIFC) and is an idealized model for unilateral 
transmitter cooperation. The Discrete Memoryless CIFC (DM-CIFC) is a CIFC with finite cardinality input and output alphabets 
and a memoryless channel described by the transition probabilities Pyi,Y2\Xi.X2{^1:^2)- 



Tx 1 



Rx 1 




Tx 2 



W2 Xi'{W2) 



Rx 2 



W2 



Fig. 1. The CIFC model. 

Transmitter i = {1, 2} wishes to communicate a message Wi, uniformly distributed on [1, . . . , 2^^'], to receiver i in N 
channel uses at rate Ri. The two messages are independent. A rate pair (i?i,i?2) is said to be achievable if there exists a 
sequence of encoding functions 

X^ = X^iWuW2) 

and a sequence of decoding functions 

W, = W,{Yi'), f = {l,2} 



such that 



lim max Pr 

AT-foo i={l,2} 



-4 0. 



The capacity region is defined as the closure of the region of all achievable (i?i, R2) pairs ifTSl . 

III. Existing results for the DM-CIFC 

We now present the existing outer bounds and the capacity results available for the DM-CIFC. The first outer bound for the 
CIFC was obtained in |4 Thm 3.2] by the introduction of an auxiliary Random Variable (RV). 

Theorem III.l. One auxiliary RV outer bound of ^ Thm 3.2]: If R2) lies in the capacity region of the DM-CIFC then 

Ri < I{Xi;Yi\X2) (la) 
R2 < I{X2,U-Y2) (lb) 

i?l+i?2 < I{X2,U-Y2)+I{Xr,Yi\X2,U), (Ic) 

taken over the union of distributions that factor as 

PU,Xi.X2PYi,Y2\XuX2- 
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Another general outer bound for the capacity region of the CIFC is provided in 18] Thm 4]. This outer bound is derived 
using an argument originally devised in |9| for the Broadcast Channel (BC). The expression of the outer bound is identical to 
the outer bound in [9] but the factorization of the auxiliary RVs differs. 

Theorem III.2. BC inspired outer bound of IX Thm. 4 ]: If (i?i,i?2) lies in the capacity region of the DM-CIFC then 

Ri < I{V,Ui-Yi) (2a) 

i?2 < I{V,U2;Y2) (2b) 

R1+R2 < I{V,Ui;Yi)+I{U2;Y2\Ui,V) (2c) 

R1+R2 < I{V,U2;Y2)+I{Ui;Yi\U2,V), (2d) 

taken over the union of distributions that factor as 

PUiPU2PV\Ui, U2PX2 \U2.yPXi\Ui.,U2, VPYi .Y2\Xi,X2- 

It is not possible to show in general the containment of the outer bound of Theorem IIII. ll "one auxiliary RV outer bound", 
into the region of Theorem IIII. 21 "BC inspired outer bound". 

The expression of the outer bound of Theorem IIII.ll "one auxiliary RV outer bound", can be simphfied in two instances 
called weak and strong interference. 

Corollary III.3. Weak interference outer bound of ^ Thm 3.4]: 
When the condition 

IiU;Y2\X2)<IiU;Yi\X2) \/pu.x„X2, (3) 
is satisfied, the outer bound of Theorem UII.ll . "one auxiliary RV outer bound", can be equivalently expressed as 

Ri<I{Yi;Xi\U,X2) (4a) 
R2<nU,X2;Y2), (4b) 

taken over the union of all distributions pu.Xi,X2- 
We refer to the condition in (O as the "weak interference condition". 

Corollary III.4. Strong interference outer bound o/^ Thm 5]: 

When the condition 

IiXi;Yi\X2) <I{Xi;Y2\X2) Vp^i.x,, (5) 

is satisfied, the outer bound of Theorem UII.ll . "one auxiliary RV outer bound", can be equivalently expressed as 

Ri<HYi;Xi\X2) (6a) 
Ri + R2<I{Y2;Xi,X2) (6b) 

taken over the union of all distributions pxi.X2- 
We refer to the condition in (|5]) as the "strong interference condition". 

The outer bound of Theorem IIII. 1 1 ."one auxiliary RV outer bound", may be shown to be achievable in a subset of the "weak 
interference" (O and of the "strong interference" ^ conditions. We refer to these subsets as the "very strong interference" 
and "very weak interference" regimes. 

Theorem III.5. Very weak interference capacity of [4. Thm. 3.4] and [5. Thm. 4.1]. 

The outer bound of Corollary 17/7. 3] "weak interference outer bound", is the capacity region if 

I{U-Y2\X2)< I(U-Y^\X2) 

I{X2]Y2)< I{X2;Yi), ypu,XuX2- (7) 



We refer to the condition in (|7]i as "very weak interference". In this regime capacity is achieved by having encoder 2 transmit 
as in a point-to-point channel and encoder 1 perform Gelf 'and-Pinsker binning against the interference created by transmitter 2. 
In a similar spirit, capacity may be obtained in "very strong interference". 
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Theorem III.6. Very strong interference capacity of [E, Thin. 5]. The outer bound of Corollary \III.4\ "strong interference 
outer bound", is the capacity region if 

/(Xi;Yi|X2) < I{Xi;Y2\X2) 

/(Fa; ^1,^2) < /(ri;Xi,X2), ^px,,x,. (8) 



We refer to the condition in ([8]l as "very strong interference". In this regime, capacity is achieved by having both receivers 
decode both messages. 

The outer bounds presented in Theorem lIII.il "one auxiliary RV outer bound" and 1111.21 . "BC inspired outer bound", cannot 
be evaluated in general since they include auxiliary RVs whose cardinality has not yet been bounded. In the following we 
propose a new outer bound, looser in general that the outer bound of Theorem 1111.11 without auxiliary RVs. This bound is 
looser than the outer bound of Theorem 1111. ll "one auxiliary RV outer bound", in the general case, but it is tight in the "very 
strong interference" regime. 

IV. A NEW OUTER BOUND 

Theorem IV.l. If {Ri,R2) lies in the capacity region of the DM-CIFC then 

i?i </(n;^i|^2), (9a) 

R2<I{Xi,X2\Y2), (9b) 

Ri + R2 < I{Xi.X2\Y2) + I{Yr,X^\Y^,X2), (9c) 
taken over the union of all distributions pxx.Xz i^^^d pY^y^\Xi,X2' where Y2 has the same marginal distribution as Y2, i.e., 

PY^\Xi,X2 = PY2\Xi,X2- 

The idea behind this outer bound is to exploit the fact that the capacity region only depends on the marginal distributions 
Pyi\Xi,X2 ™d Py2\Xi.X2 because the receivers do not cooperate. 

Proof: By Fano's inequality we have that H{Wi\Y/^) < NeN, for some such that epf as N ^ for i e {1, 2}. 
The rate of user 1 can be bounded as 

NiRi-eN)<I{Wi;Yf) 

<IiWi;Y^''\W2) 

^ I{WuX^{Wi,W2);Y,''\W2, X^{W2 ) ) 

< HiY^''\W2,Xi') - HiY,''\W2,Wi,X^, X^) 

< H{Y^\X^) - H{Yl'\W2.Wi,X^,X^) 
^H{Y,''\X^)-H{Yf\X^,X^) 

N 

= J2h{Yu\X^, (Fi)!-!) - H{Yu\X^,X^, (Fi)!"!) 

i=l 
N 

< Y,H{Yu\X2^) ~ H{Y,^\Xu,X2^) 
1=1 

= NI{Yit;Xit\X2tX} 

= N{H{YiT\X2T,T) - H{Yit\Xit,X2t,T)) 

- N{H{Yit\X2t,T) - H{Yit\Xit,X2t)) 

< N{H{Yit\X2t) ~ H{Yit\Xit,X2t)) 

<I{Yit;Xit\X2t), (10a) 
where T is the time sharing RV, informally distributed over the set {l...A^} and independent on the other RVs. 



6 



The rate of user 2 can be bounded as 

N{R2-eN) <I(Yf;W2) 

<I{Yf;W2,W,) 

= HiYi") - i/(y/|W^i, W^2,Xf (T^2),Xf (T^i, W2)) 
= H{Yr)-H{Yi'\X^,X^) 

N 

= Y.HiY2^\(Y2)\-') H(Y^,\X^,X^, {Y,)\-^) 

i=l 
N 

<Y,H(Y2^) - H{Y2,\Xu,X2^) 

i=l 

<NI{Y2t;Xit,X2t\T) 

= N{H{Y2t\T) - H{Y2t\Xit,X2t,T)) 

< N{H{Y2t) - H{Y2t\XiT.X2t)) 

<I{Y2t;X,t,X2t)- (10b) 

Next let Y2 be any RV such that Py.^\Xi.X2 — 1X1,^2 ^lut with any joint distribution Pyi.Y2\Xi,X2- The sum-rate can 
then be bounded as 

N{Ri + R2- 2NeN) < I{Wi;Yi) + /(W^a; ^"2) 

< I{Wi;Yf\W2) + I{W2;Y2'') 
<I{Wi; Y,^ , Y^^ I W2)+I{W2; if ) 
- /(W^2; K,^) + I{WuY^''\W2) + I{Wi;Y,^\Y^^,W2) 
= H{Y2'') + ( - H{Y2''\W2) + H{YJ,''\W2)] 

- li{Y!2''\W^,W2) + 1/(^1^1^2"^, W2) - H{Yf\Yi'', Wi,W2) 
= H{Y2'') + H{Y,''\W2,X^, K,"^) 

- H(Y^''\W,,W2,X^, X^) - HiY^^'lY^", W^i, W^2, , X^) 

= H{Yi') + H{Y,''\W2.X^X'') 

H{Y2''\X^,X^) - H{Y,^\Yi^,X^, X^) 

<H{Y2'') + H{Yl'\X^,Yr) 

- H{Yi'\X^,X^) - H{Y,^\Y^\X^, X^) 

N 



< /(r2^;xf ,^2^) + J2h{Yu\x^,y^'', (ri)r') - h{Yu\x^',x^,y^'', {y,)\-') 

1=1 

N 

< I{Yf;X^, X^) + ^ H{Yu\X2^, Y^^) - H{Yu\Xu, X2^Xi) 



i=l 
N 



< I{Y2^;X^, X^) + H{Yu\X2^, Y^d - HiYu\Xu, X2^,Y^i) 

1=1 

^ N {I{Y2t;Xit,X2t) + H{Yit\X2t,Y^t,T) - H{Yit\Xit, X2t,Y^t)) 

< N {I{Y2t;Xit,X2t) + I{Yit;Xit\X2t,Y^t)) ■ (10c) 

■ 

Remark IV.2. The outer bound of Theorem II V. 1 1 contains the outer bound of Theorem IIILll "one auxiliary RV outer bound". 
Indeed, for a fixed distribution pxi,X2^ (fTa|) = (|9a)) and ^Tb\i < (|9b|) since 



m =I{Y2;X2,U) 

< 7(^2; ^2, U) + I{Y2;Xi\U, X2) 
= I{Y2;Xi,X2,U) 
= I{Y2;Xi,X2) = 

where the last equality follows from the Markov chain U — Xi,X2 — Yi,Y2. 
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Consider such that PYi\u,Xi,X2 = PY2\u,Xi,X2^ which also implies Py4\u,X2 = PY2\u,X2 since 

= ^ / Py2|(7,-^l:X2P(7,Xi,Jf2'^-^l 



then: 

(Hil =/(r2;^2,f/) + /(Xi;n|C/,^2) 

- H{Y2) + H{Y2\Xi,X2,U) - H{Y2\U,Xi,X2) - H{Y2\U,X2) + I{Xi;Yi\U, X2) 
= I{Y2;Xi,X2, U) + H{Yi\U, Xi, X2) - H{Yi\U, X2) + I{Xi-Yi\U, X2) 

< I{Y2;Xi,X2) - I{Yi- Xi\U, X2) + I{Xi-Yi\U, X2) + I{Yi- Yi\U, Xi, X2) 
= I{Y2;XuX2) - HY^; Xi\U, X2) + I{Y^, Xi:Yi\U, X2) 

= 7(^2 ; Xi , X2) + I{Yi ; Xi 1^2', U, X2) 

= I{Y2 ; Xi , X2) + (Fi 1^2', U,X2)- H{Yi \Yl U, Xi , X2) 

(b) 

< 7(^2; Xi, X2) + i7(ri 112', X2) - H{Yi \Y^, Xi,X2) 
= I{Y2;Xi,X2) + I{Yi;Xi\YlX2) = ©. 

Now the RV U does not appear in the outer bound expression ( l9c] i and thus we can consider simply the RVs with Pp^jXi X2 ~ 
Py2\Xi,X2 which corresponds to the definition of Y2 in Theorem |9] 

Equality of the outer bounds is verified when conditions (a) and (6) hold with equality, that is when 

IiY2;Xi\U,X2)^0 

IiYuXi\Y2,U,X2) ^ I{Yi;Xi\Y2,X2) ^pu, 
for a given Y2. The first conditions implies the Markov Chain (MC) 

Y2-U,X2-Xi 

and the second condition the MC _ 

Yi,Xi^Y2X2-U 

We currently cannot relate these conditions to any specific class of DM-CIFC. 

Remark IV.3. The outer bound of Theorem IIV. 1 1 reduces to the strong interference outer bound in (O, in fact 

/(ri;Xi|X2) < 7(^2; Xi 1X2) Vpxi,X2 

implies 

IiYuXi\Y^,X2) < I{Y2;Xi\Y^,X2) Vpxi,X2,y- 
Now let Y2 = Y2 to obtain that I{Yi; Xi\Y2, X2) ^ yielding = (|9b]) so that the two outer bounds coincide. 

V. A NEW INNER BOUND 

As the DM-CIFC encompasses classical interference, multiple-access and broadcast channels, we expect to see a combination 
of their achievability proving techniques surface in any unified scheme for the CIFC. Our achievability scheme employs the 
following classical techniques: 

• Rate-splitting. We employ a rae-splitting technique similar to that in Han and Kobayashi's scheme of lfT9ll for the interference- 
channel, also employed in the DM-CIFC regions of [31, |[8|, ll20ll . While rate-splitting may be useful in general, is not necessary 
in the very weak |4| and very strong |21| interference regimes of (|2l and ((S]). 

• Superposition-coding. Useful in multiple-access and broadcast channels IITSl . in the DM-CIFC the superposition of private 
messages on top of common ones, as in |8l, f20l, is known to be capacity achieving in very strong interference ll2n . 

• Binning. Gel'fand-Pinsker coding |22J, often simply referred to as binning, allows a transmitter to "cancel" (portions of) the 
interference known to be experienced at a receiver. Binning is also used by Marton in deriving the largest known achievable 
rate region (23\ for the discrete memoryless broadcast channel. 

We now present a new achievable rate region for the DM-CIFC which generalizes all the known achievable rate regions 
presented in H, iH, |[TOl, lEO), El and ll25l . 

Theorem V.l. The "Rrtd region. A rate pair (i?i,i?2) such that 

Ri = Ric + -Ripb, 

i?2 = R2C + R2pa + R2pb- (11) 



g 



Ri = Ric + Ripb 



W2^W2{W2c,W2ph,W2pa 
i?2 = R2c + R2pa + R2pb 



■W: 



2pb 



■Wo 



■W, 



2pd~ 



7-7^ 



'PX2\U2. )- 



PU2pb\Ulc,U2c,X.2 
PU*p,,\Ulc_,U2a 



P. 



■X, 



Fig. 2. The achievability encoding scheme of Thm. IV. 1 1 The ordering from left to right and the distributions demonstrate the codebook generation process. 
The dotted lines indicate binning. We see rate splits are used at both users, private messages Wipj, , W2pa , W^2p6 superimposed on common messages 
Wic,W2c and Uic, is binned against X2 conditioned on U2c, while Uipf, and U2pb are binned against each and X2 other in a Marton-like fashion 
(conditioned on other subsets of RVs). 
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I{Yi 


Ulpb, Ulc, U2c), 




(12i) 




{Ric- 


F R[c) + 


(-Rlpb 4 


" R'lpb) 


< 


HYi 


Ulpb, Uic\U2c), 




(12j) 








(-Ripb 4 


" R'lpb) 


< 


I{Yi 


C^lpb C^lc, U2c), 




(12k) 



/or some input distribution 

PYuY2,Xi,X2,Ula,U2a,U2pa,Uipb,U2pb 



PUl^,U2o,U2pc.,Uipb,U2pb,Xl,X2PYi,Y2\XuX2- 



Remark V.2. Moreover: 

. ( I12db can be dropped when i?2c = i?2pa = -R2pb = -R2pb = 0; 

• ( I12eb can be dropped when i?,2pa = i?2pb = ^2pfc = 0; 

• ( 12gi can be dropped when i?2pb = -R2pb = 0; 

• (I12il l can be dropped when Ric = R[c = Ripb = R'lpb = 0, 

since they correspond to the event that a common message from the non-intended user is incorrectly decoded. This event is 
not an error event if no other intended message is incorrectly decoded. 

Proof: The meaning of the RVs in Theorem IV. II is as follows. Both transmitters perform superposition of two codewords: 
a common one (to be decoded at both decoders) and a private one (to be decoded at the intended decoder only). In particular: 

• Rate Ri is split into Ric and Ripb and conveyed through the RVs Uic and Uipb, respectively. 

• Rate i?2 is split into i?2c, i?2pa and i?2pfc and conveyed through the RVs U2c,X2 and t/2pb, respectively. 

• U2c is the common message of transmitter 2. The subscript "c" stands for "common". 

• X2 is the private message of transmitter 2 to be sent by transmitter 2 only. It superimposed to U2c- The subscript "p" 
stands for "private" and the subscript "a" stands for "alone". 

• Ulc is the common message of transmitter 1. It is superimposed to U2C and - conditioned on U2C - is binned against X2- 

• Ulpb and U2pb are private messages of transmitter 1 and transmitter 2, respectively, and are sent by transmitter 1 only. They 
are binned against one another conditioned on U2C, as in Marton's achievable rate region for the broadcast channel ||23]| . 
The subscript "b" stands for "broadcast". 

• Xi is finally superimposed to all the previous RVs and transmitted over the channel. 

A graphical representation of the encoding scheme of Theorem IV. 1 1 can be found in Figure |2] The formal description of the 
proposed encoding scheme is as follows: 
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A. Rate splitting 

Let Wi and W2 be two independent RVs unifomily distributed on [1...2^^i] and [1...2^^^] respectively. Consider splitting 
the messages as follows: 

W2 = {W2c,W2pb,W2pa), 

where the messages Wi, i £ {Ic, 2c, lpb,2pb,2pa}, are all independent and uniformly distributed on [1...2^^'], so that the 
rate are 

i?l = Ric + Rlpb, 

R2 = R2C + R2pa + R2pb- 



B. Codebook generation 

Consider a distribution pr/i, .r/j, ,x,.!7ip6.c/ip6.Xi,X2- The codebooks are generated as follows: 

• Select uniformly at random 2^^'^" length- sequences U^{w2c), W2c G [1...2^^2c]^ from the typical set T^{pu2c)- 

• For every W2c S [1...2^^^'=], select uniformly at random 2^^^"° length- A?' sequences X^{w2c, W2pa), W2pa G [1...2^^^f<'], 
from the typical set T/^bxa.c/ao |t^2^(«^2c))- 

• For every W2c G [1...2^^^''], select uniformly at random 2^^^^''~^^'^c> length-A^ sequences U^{w2c,wic,bo), wic & 
[1...2^«i<=] and e [1...2^«i<=], from the typical set {pu^^u2AUi^ciw2c)) 

. For every W2c e [1...2"^2c]^ ^^^^pa G [1...2^^2pa]^ ^ [1...2^^i<=] and 5o € [1...2^^'k=], select uniformly at random 
2N(R2pk+R'2p,) length-TV sequences Uii^{w2c,W2pa,wie,bo,W2pb,b2), W2pb G [1...2'^^^'"'] and 62 S [1...2^-"^p6], from 
the typical set T^iPu2pt,U2o,Uio,X2 \U2iiw2c), X^{w2c, W2pa), U^^{w2c-, wu, bo)). 

. For every W2c G [1 . . .2^ ^'"'], wic G [1...2^-^i-] and 60 G [1...2^^'k=], select uniformly at random 2^^^^^''+^'^p^'> length-A/' 
sequences U^p^{w2c,wic,bQ,wipb,bi), wipb G [1...2^^i!'''] and 61 G [1...2^^^v^\, from the typical set 

T^{PU^^^,U2.,UiAU^ciw2c),U^^{w2c,Wu,bo))- 

. For every W2c G [1...2^-«2<=], W2pa G [1...2^«2'«'], wi^ G [1...2^-"i-], 60 G [1 : 2^«'k=], wipb G [1...2^«ip''], 5i G [1 : 
2^-Ripi>], W2pb G [1...2^-«2p''], 62 G [1 : 2^-^2pi'], let the channel input {w2pa,W2c,wieM,w\pbM,W2pbM) be any 
length- A/^ sequence from the typical set 

Ti^iPXi,U2c,Uic,X2,U2pb,Uiph \U2ciw2c), X^{W2c, W2pa), U^d'^^c, Wlc, bo), U^pb{W2c, W2pa, Wlc, bo, W2pb, ^2), 

U^pb{w2c, Wlc, bo, Wipb, bi)). 



C. Encoding 

Given the message W2 = {w2c,W2pb,W2pa), encoder 2 sends the codeword {■W2c,W2pa)- 

Given the message W2 = {w2c, W2pb, W2pa) and the message w\ = (wic, w\pb), encoder 1 looks for a triplet {po, &i, ^2) such 
that: 

{U2c{W2c), X^{w2c, W2pa), C^l^('U'2c, Wu, bo), U^pb{W2c, Wu, bo, Wipb, bl),U2pb{W2c, Wu, bo, W2pb, ^2)) 

G iPU2c,X2,Ulc,Uipb,U2pb)- 

If no such triplet exists, it sets (60, ^2) = (1,1,1)- If more than one such triplet exists, it picks one uniformly at random 
from the found ones. For the selected (&o,&i,&2), encoder 1 sends X(^ {'W2pa,W2c,Wlc,bo,Wlpb,bl,W2pb,b2)■ 
Since the codebooks are generated iid according to 



(codebook) 

P - PU2c PX2\U2,: Plh,:\U2c PU 2pb\U 2c,U ,X2 PU ipb\U2c -.Ul^ 

but the encoding forces the actual transmitted codewords to look as if they were generated iid according to 

p(encoding) ^ ^^^^ PX2\U2a PUla\U2c,X2 PU2pb\U2c,Ulc,X2 PUlpb\U2c,Ulc,X2,U2pb' 

We expect the probability of encoding error to depend on 



(13) 



(14) 



E 



P 



(encoding) 



p(codebook) 



E 



PUla\U2a,X2 PUipb\U2a,Ula,X2,U2pb 



PUlc\U2cPUlpb\U2c,Ul^ 



= I{Uu;X2\U2c) + I{Uipb;X2,U2pb\U2c,Uu). 
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D. Decoding 

Decoder 2 looks for a unique tuple {w2c^ W2pa, W2pb) and some {wic, bo, 62) such that 

{U2c{w2c),X2{w2c,W2pa),U"^{w2c,Wic,bo),U2pb{w2c,Wic,bo,W2pb,b2),Y^) G bc/2c ,^2 ,(7lc ,;72pb , V2 ) ■ 

Depending on which messages are wrongly decoded at decoder 2, the transmitted sequences and the received are 
generated iid according to 



P2\* - PU2c PX2\U2a PUla\U2a P(72p6 | (72c , C/lc ,^2 | * : 



(15) 



where indicates the messages decoded correctly. However, the actual transmitted sequences and the received considered 
at decoder 2 look as if they were generated iid according to 



P2 - PU2a PX2\U2a -P(7l c | (72o , ^2 PU2pb\U2a -.Ul^ ,X2 P Y2 | £/2c , (7l e , ^2 , ^2^6 

Hence we expect the probability of error at decoder 2 to depend on terms of the type 

. PUla\U2a:X2 -P^b | (72c , t^l X2 , (72^6 



1 P2 

log 


^ E 


P2\* 





log- 



/([/le; X2\U2c) + I{Y2; U2c, f/lc, ^2, U2pb\*) ■ 



(16) 



(17) 



PUl,\U2. PY2\* 

Decoder 1 looks for a unique pair (wicWipb) and some {w2c,bo,bi) such that 

{U2c{w2c), U"^{w2c, Wic, bo), U"p,,{w2c, Wic, bo, Wipb, bi), Y") G T"{pu2^,Uia,Uipi.Yi)- 

Depending on which messages are wrongly decoded at decoder 1, the transmitted sequences and the received Y{^ are 
generated iid according to 



Pl\* = PU2aPUla\U2^PUlpt\U2a,UlaPYi\*, 



(18) 



where "★" indicates the messages decoded correctly. However, the actual transmitted sequences and the received considered 
at decoder 1 look as if they were generated iid according to 



Pi - PU2c PUl^\U2a PUipt\U2a,Ula PYi\U2a .Ula -.Ulpb ' 

Hence we expect the probability of error at decoder 1 to depend on terms of the type 

Pm(72c:(7lc:(7l„6 



1 p^ 

log 


= E 


Pl\* 





The error analysis is found in Appendix lAl 



log- 



/(yi;C/2c,C/ic,(7ipb|*). 



(19) 



(20) 



E. Two step binning 

It is also possible to perform binning in a sequential manner First, Uic is binned against Xi, and then Uipb and U2pb are 
binned against each other conditioned on {U2c,Uic) and {U2c, X2,Uic) respectively. With respect to the encoding operation 
of the previous section, this affects Section IV-C I as follows: 

Given the message W2 = {w2c,W2pb,W2pa) and the message wi — {wic,wipb), encoder 1 looks for bo such that 

{U2c{w2c),X2 (W2c, W2pa), U^^{w2c, Wic, bo), 
G 'E^{PU2a,X2,UiJ- 

If no such bo exists, it sets bo = 1. If more than one such bo exists, it picks one uniformly at random. For the selected bo, 
encoder 1 looks for (61,62) such that: 

{U2c{w2c),X2 {W2c, W2pa), U^^{w2c, Wic, bo) , U{^p^{w2c, Wic, bo, Wipb, bi) , U2pb{w2c, ^ic, 60, W2pb, 62)) 

e iPU2a,X2,Ula,Uipb,U2pb)- 

If no such (61, 62) exists, it sets (61, 62) — (1, 1). If more than one such (61, 62) exists, it picks one uniformly at random from 
the found ones. 

For the selected (60,61,62), encoder 1 sends X^ {w2pa,'W2c,'Wlc,bo,Wlpb,bl,W2pb,b2)■ 
The next lemma states the condition under which this two step encoding procedure is successful with high probabiUty. 



Lemma V.3. The two-step binning encoding procedure of Section \V-E\ is successful if 

R'lc > I{Ulc;X2\U2c), 
R'lpb ^ I{Ulpb', X2\U2c,Uic), 
^'ipf) + ^2p6 — I{Ulpb]X2,U2pb\U2c,Uic). 



(21a) 
(21b) 
(21c) 
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The proof of the lemma is found in Appendix (|E|. 

Remark V.4. Since the binning rate ( I12al ) of Theorem IV. II can be taken with equahty, the two step binning has the same 
performance as joint binning. In fact, by setting (|21ap to hold with equality, we obtain the equality between the binning rate 
expression of the joint binning and the two step binning. 

A plot of the permissible binning rates Ripb and i?2ph is depicted in Figure [3] 




Fig. 3. The region of tlie admissible binning rates Ripb and R2pb in Theorem IV. 1 1 



VI. Comparison with existing achievable rate regions 

We now show that the region of Theorem IV. II contains all other known achievable rate regions for the DM-CIFC. Showing 
inclusion of the rate regions ll26l Thm.2], ll24i Thm. 1] and ll25l Thm. 4.1] is sufficient to demonstrate the largest known 
DM-CIFC region, since the region of Il26l Thm.2] (first presented in IfTQ) is shown (in |26|) to contain those of |8 Thm. 1] 
and Il20l . 

A. Devroye et al.'s region l\24\ Thm. 1 ] 

In Appendix |F] we show that the region of ||24l Thm. 1] "Romt, is contained in our new region JIrtd along the lines: 

• We make a correspondence between the random variables and corresponding rates of "Rdmt and ^btd- 

• We define new regions "JIdmt ^ 3?™^/^ and 'X^^j-jj C JIbtd which are easier to compare: they have identical input 
distribution decompositions and similar rate equations. 

• For any fixed input distribution, an equation-by-equation comparison leads to "Rdait ^ '^'dmt — -^^rtd — "^RTD- 

B. Cao and Chen's region Ii26\ Thm. 2] 

The region in p26], Thm. 2] uses a similar encoding structure as that of "Rrtd with two exceptions: 

1) The binning is done sequentially rather than jointly as in JIrtd leading to binning constraints (43)-(45) in ||251 Thm. 
2] as opposed to ( I12al i-( ll2cl i in Thm lV.ll Notable is that both schemes have adopted a Marton-like binning scheme at the 
cognitive transmitter, as first introduced in the context of the CIFC in lilOi . 

2) While the cognitive messages are rate-split in identical fashions, the primary message is split into 2 parts in ll26l Thm. 
2] (Ri = Rii + i?io, note the reversal of indices) while we explicitly split the primary message into three parts i?2 = 
R2c + R2pa + R2pb- In Appendix iGl we show that the region of ll26l Thm.2], denoted as 3?cc Q ^rtd in two steps: 

• We first show that we may WLOG set Un = in Il26l Thm.2], creating a new region Rqq. 

• We next make a correspondence between our RVs and those of ll26l Thm.2] and obtain identical regions. 
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C. Jiang et al.'s region 4251 Thm. 4.1] 

The scheme originally designed for the more general broadcast channel with cognitive relays (or interference-chanel with a 
cognitive relay) may be tailored/reduced to derive a region for the cognitive interference channel. This scheme also incorporates 
a broadcasting strategy. However, the common messages are created independently instead of having the common message 
from transmitter 1 superposed to the common message from transmitter 2. The former choice introduces more rate constraints 
than the latter and allows us to show inclusion in "Rbtd after equating random variables. The proof of the containment of the 
achievable region of ll25l Thm. 4.1] in 'JIrtd is found in Appendix iHl 

VIL New capacity results for the DM-CIFC 

We now look at the expression of the outer bound |4 Thm. 3.1] to gain insight into potentially capacity achieving achievable 
schemes. In particular we look at the expression of the corner points of the outer bound region for a fixed pu, X1.X2 ™d try 
to interpret the RVs as private and common messages to be decoded at the transmitter side. We then consider an achievable 
scheme inspired by these observations and show that schemes achieve capacity for a particular class of channels. This class 
of channels contains the "very strong" and the "very weak" interference regimes and thus corresponds to the largest class of 
channels for which capacity is currently known. 

The outer bound region of l|4j Thm. 3.1] has at most two corner points where both Ri and R2 are non zero: 

(i?r* ^"\rT *"^) = iHY,;X,\U,X,),I{Y2;U,X2)) (22) 
(i^r* ^'\i?r ^'^) = {I{Yi;Xi\U,X2)+I{Y2;U,X2)~A,A) (23) 

A ^ [IiY2;U,X2) - I{Yi;U\X2)] + , 

since 

=mm{I{Y2;U,X2),I{Y2;U,X2) + I{Yi;X,\U,X2)} = IiY2;U,X2), 
^mm{I{Yi;Xi\U,X2),I{Yi;Xi\X2)}^I{Yi;Xi\U,X2), 

and 

= mm{IiY2;U,X2),IiY2:U,X2) + IiYi;Xi\U,X2) - I{Yi;Xi\X2)} 

= [I(Y2; U, X2) + min{0, /(Fi; Xi|C/, X2) - /(Yi; Xi, U\X2)}] + 

= [I{Y2; U, X2) - I{Yi; U\X2)]+ = A, 

<mm{I{Yi;Xi\X2)J{Y2;U,X2)+I{Y,;Xi\U,X2)} 

= I{Yi;Xi\U,X2) + I{Y2; U,X2) - max{/(y2; U,X2) - I{Yi- U\X2)M 

= I{Yi-Xi\U,X2) + I{Y2;U,X2) - A. 

Proving the achievability of both these corner points for any pu.X1.X2 shows capacity by a simple time sharing argument. 

We can now look at the corner point expression and try to draw some intuition on the achievable schemes that can possibly 
achieve these rates. For the corner point iJj"^) we can interpret (J7, X2) as a common message from transmitter 2 to 

receiver 2 that is also decoded at receiver 1. Xi is superposed to ([/, X2) since the decoding of Xi follows the one of (?7, X2) 
at decoder 2. 

The corner point ^^\r'^^ '■'''') has two possible expressions: 

1) If I{Yi-U\X2) < I{Y2; U, X2) we have that 

(i?r* ^''^\rT "'^') = {I{Yi:X,,U\X2),I{Y2-,U,X2) - I{Y,-U\X2)) , (24) 

which suggests that X2 is again the common primary message and the cognitive message is divided into a public and private 
part, U and Xi respectively. 

2) If I{Yi-U\X2) > I{Y2\ U, X2) we have that 

^''^") = inY2;U,X2) + IiY,;X,,U\X2),0) . (25) 

In this case the outer bound has only one corner point where both rates are non zero. Note that we can always achieve the 
point 

{RT ,Rl- ^'^") = {I{Y,;X,,U\X2),0) 
by having transmitter 2 send a known signal. In this case we have i?2"* ^''^ — -R™ ''''' ^^'^ < i?™ '"''^ since 

/(Yi; Xi, U\X2) > /(I2; U, X2) + I{Yi-Xi, U\X2) 
I{Y,;U\X2)> I{Y2;U,X2). 

So in this case showing the achievability of the point in equation ( |23] | is sufficient to show capacity. 

Guided by these observations, we consider a scheme that has only the components U2C, C/ic and Uipb- That is, the primary 
message 0J2 is common and the cognitive message wi is split into a private and a public message. With this scheme we are able 
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to extend the capacity results in the "very weak interference" of Theorem IIII.5I and the "very strong interference" of Theorem 
nil. 61 This scheme coincides with the scheme of (TT\ which achieves capacity if the cognitive receiver is required to decode 
both messages (with and without the secrecy constraint). 

Theorem VII.l. Capacity in the "better cognitive decoding" regime. 

When the following condition holds 

IiYi;X2,U)>I{Y2;X2,U) ypx^,x,,u, (26) 
the capacity region of the DM-CIFC is given by region in ([T]l. 

Proof: Consider the achievable rate region of Theorem IV. II when setting 



Xi — Uipb 

X2 = U2C = U2pb 

SO that 

i?2 = R2c 

Rlpa = R2pb = 

^Ic = ^Ipb — ^2pb — ^■ 

In the resulting scheme, the message from transmitter 2 to receiver 2 is all common while the message from transmitter 1 to 
receiver 1 is split into common and private parts. The achievable region of this sub-scheme is: 

R2+R1C < I{Y2;Ui,,X2) (27a) 

R2+Ric + Ripb < I{Yi;Ui„X2) (27b) 

Ric + Ripb < I{Yi-Uic,Xi\U2c) (27c) 

Ripb < /(Yi;Xi|X2,{/ic). (27d) 
By applying Fourier-Motzkin elimination |28| we obtain the achievable rate region 

Ri < I{Yi;Uic,Xi\X2) (28a) 

R2 < I{Y2;Uic,X2) (28b) 

R1+R2 < I{Y2;Uic,X2)+I{Yi;Xi\X2,Ui,) (28c) 

R1+R2 < I{Yi;X2,U,c,Xi). (28d) 

By letting Uic = U we see that ( [Tal l matches ( I28al l. ([Tbl l matches (I28bl l. ([Tel l matches (I28cl l. and (I28db is redundant when 

I{Yi;X2,Xi,U) > IiY2;U,X2) + I{Yi;Xi\X2,U), 

or equivalently when 

I{Yi;U,X2)>IiY2;U,X2). (29) 



We term the condition in equation ( I29I I "better cognitive decoding" since decoder 1 has a higher mutual information between 
its received channel output and the RVs U and X2 than the primary receiver 

Remark VII. 2. The "better cognitive decoding" in ( |29] | is looser than both the "very weak interference" condition of ^ and 
the "very strong interference" condition of ([8]). In fact summing the two equations of condition O we have 

I{U;Yi\X2) + IiX2;Yi) > I{U;Y2\X2) + IiX2;Y2) <^ IiYuU,X2) > IiY2;U,X2) 

which corresponds to condition ( |29] |. Similarly by summing the two equation of condition (O we obtain 

IiYi;Xi,X2) + I{Xi;Y2\X2) > 7(^2 ; ^1, ^2) + /(^i; 1^2) ^ 

I{Yi;Xi,X2)-I{Xi;Yi\X2) > /(llj; ^1, ^2) - /(^i; i"2|^2) 

I{Yi;Xi,X2,U)~IiXi;Yi\X2) > /(l^j; ^1, ^2, C/) - /(Xi; ^21^2) ^ 

IiYi;X2,U) > I{Y2;X2,U) 

which again corresponds to condition ( |29] |. 
Since both (|7]i and (O imply the ( l29b . we conclude that ( |29] | is more general than the previous two. 

The scheme that achieves capacity in very weak interference is obtained by setting Uic — X2 so that all the cognitive message 
is private and the primary message is common. The scheme that achieves capacity in very strong interference is obtained by 
setting Uic = Xi so that both transmitters send only public messages. The scheme that we use to show the achievability in 
the "strong cognitive decoding" regime mixes these two schemes by splitting the cognitive message into public and private 
messages. This relaxes the strong interference achievability conditions as now the cognitive encoder needs to decode only part 
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of the cognitive message. The scheme also relaxes the very weak achievability condition since it allows the cognitive encoder 
to decode part of the cognitive message and remove its unwanted effects. For this reason, the resulting achievability conditions 
are looser than both cases. 



VIII. Capacity for the semi-deterministic CIFC 

Consider the specific class of DM-CIFC for which the signal received at receiver 1 is a deterministic function of the channel 
inputs, that is 

Fi = /i(Xi,X2). (30) 

This class of channels is termed semi-deterministic CIFC and it was first introduced in |26|. In |26| the capacity region is 
derived for the case I{Yi;X2) > I{Y2;X2)', we extend this result by determining the capacity region in general (no extra 
conditions). Note that the authors of [26] consider the case where /i is invertible; we do not require this condition. 

Theorem VIII.l. The capacity region of the semi- deterministic cognitive interference channel such that ^301 ) consists of all 
(i?i,i?2) e R+ such that 

Ri < H{Yi\X2) (31a) 
R2 < I{Y2;U,X2) (31b) 
R1+R2 < I{Y2;U,X2)+H{Yi\U,X2) (31c) 

taken over the union of all distributions pu,Xi,X2- 

Proof: Outer bound: The outer bound is obtained from Theorem IIII.ll "one auxiliary RV outer bound" , by using the 
deterministic condition in ( l30b . 

Achievability: Consider the scheme with only the RVs X2, Uipb and U2pb, obtained by setting U2C — Uic — ^- The achievable 
rate region of Theorem IV. 1 1 becomes: 

R'lpb > I{Uipb\X2) (32a) 

R'lpb + R'lpb ^ IiUlpb',U2pb, X2) (32b) 
Rlpa + Rlpb + R'lpb ^ lO^l'j U2pbi X2) (32c) 

Rlpb + R'lpb < HYi;U2pb\X2) (32d) 
Rlpb + R'lpb < I{Yi;Uipb), (32e) 

taken over the union of all input distributions PUipfV2pb,XuX2PYi,Y2\Xi,X2- 

From the Fourier Motzkin elimination of this sub-scheme, we see that we can set i?2p6 — without loss of generaUty and 
that the region can be rewritten as 

MUipb,U2pb,X2) = { Ri <I{Yi-Uipb)-I{Uipb;X2) (33a) 

Ri < I{Y2; U2pb\X2) - HUipb-.. U2pb\X2) + I{Yi-Uipb) - I{Uipb;X2) (33b) 

Ri <I{Y2;U2pb,X2) (33c) 

Ri+Ri <I(Y2;U2pb,X2)+I(Yi;Uipb)-I{Uipb;U2pb,X2)} (33d) 

taken over the union of all distributions that factor as 



Let 



and 



PUlpb,U2pb:Xi,X2PYuY2\XuX2- (34) 

^i{Uipb,U2pb,X2) = { Ri <I{Yi;Uipb)-I{Uipb;X2) (35a) 

Ri <I{Y2;U2pb,X2) (35b) 

Ri+Ri <I{Y2;U2pb,X2)+I{Yi;Uipb)-I{Uipb;U2pb,X2)} (35c) 

MUipb,X2)^{ Ri <I{Yi-Uipb)~I{Uipb;X2) (36a) 

Ri <I{Y2;X2)]. (36b) 



Notice that 
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Since 

^2{Ulpb, X2) = 'Jil{Uipb, U2pb = X2, X2) = 'JioiUlpb, U2pb ~ X2,X2), 

and 3io(t^ipb, U2pb,X2) has one less constraint than "RiiUipb, U2pb,X2)- 
We now wish to show that 

that is, equation ( I33bl i can be removed from the Fourier Motzkin eliminated region of (|32] |. The proof of this equivalence 
follows that of 1291 Lemma 2]. For P{Uipb, U2pb,X2) such that 

HY2; U2pb\X2) - HUipb-, U2pb\X2) > 

we have 

"^liUlpb, U2pb, X2) — 'Ro{Uipb, U2pb, X2). 

For those P{Uipb, U2pb, X2) such that 

HY2; U2pb\X2) - HUipb-, U2pb\X2) < 

we have that the point 

(i?i,i?2) = iI{Y,;U,pb)-I{Uipb;X2)),I{Y2;X2)) 

is achievable in 312- This point lies inside "Ri and !Ro and satisfies all the rate constraints in (l33T l but (I33bl l. In particular, the 
sum rate equation ( I33dl ) given by 

Ri+R2< I{Y2; U2pb, X2) + I(Yi;Uipb) - I{Uipb; U2pb, ^2), 

which implies 



R2<I{Y2;X; 



2 



R2 < I{Y2; U2pb, X2) + I{Yi-Uipb) - I{Uipb-, U2pb, X2) - Ri 

= 1(^2; ^2) + I{Y2] U2pb\X2) - liUipb-, U2pb\X2) 
<I{Y2\X2). 

Using time sharing we can show the achievability of the whole region 'Ri n 3?o, which means that the rate points that are not 
in 3?o(C^ip6j C^2p6, -^^2) are in 3?2(t^ipb, ^2)- But since 3^2(C^ipfc, -''^2) is special case of 'Ro{Uipb,U2pb, X2), we conclude that 

RliUipb, U2pb, X2) — 'JiQ{Uipb, U2pb, X2). 

This means is that decoder 2 must not decode U2pb if that imposes a more stringent rate constraint than the decoding of Uipb 
at the intended decoder 1. For this reason U2pb can be chosen so that U2pb — X2 without loss of generality. This shows that 
!Ki is achievable and thus concludes the achievability proof. 

■ 

Remark VIII.2. The achievable scheme of equation ( [32] l cannot be obtained as a special case of any previously known achievable 
scheme but f25l. The RV U2pb, which broadcasts the private primary message from transmitter 1, appears in f2E{ as well. In 
this scheme though is not possible to reobtain the scheme of equation (|32] | with a specific choice of the RVs since the same 
message W2pa is transmitted in U2pb and the private primary message X2. 

IX. Capacity for the deterministic CIFC 
In the deterministic CIFC both outputs are deterministic functions of the channel inputs, that is 

Fi = ri(Xi,X2) 

Y2^Y2{Xi,X2). (37) 

This class of channels is a subclass of the semi-deterministic CIFC of Section IVIIII and we already have obtained the capacity 
region for this case. However, we re-derive the capacity region in a new fashion for this channel, which illustrates two new 
ideas: 

1) We show the achievability of the outer bound of Theorem II V. 1 1 when letting — Y2, instead of the outer bound of 
Theorem nil. 11 "one auxiliary RV outer bound". 

2) We show achievability of this outer bound using a single unified scheme. 
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Xi G {0,1,2,3} 



G {0,1,2,3} 



Tx 1 



Rx 1 



{2,3} 



Vi 



Tx 2 



X2 G {0 .. 7} 




G {0 .. 7} 



Fig. 4. The "asymmetric clipper" of Section IX-AI 



Theorem IX.l. The capacity region of the deterministic cognitive interference channel consists of all (i?i,i?2) G such 
that 

Ri < H{Yi\X2) (38a) 

i?2 < H{Y2) (38b) 

R1+R2 < H{Y2)+H{Yi\Y2,X2) (38c) 

taken over the union of all distributions pxi,X2- 

Proof: Outer bound: The outer bound is obtained from Theorem II V. 1 1 using the deterministic conditions in ( l37b . 
Achievability: Consider the scheme in ( l35T l and let Uipb — Yi, U2pb = Y2 to achieve the region 

Ri < H{Yi\X2) (39a) 
R2 < H{Y2) (39b) 
R1+R2 < H{Y2;U,X2) + H{Yi\Y2,X2) (39c) 

which corresponds to the outer bound in ( |38] |. ■ 

X. Examples 

The scheme that achieves capacity in the deterministic and semi-deterministic CIFC uses the RV U2pb to perform Gel'fand 
Pinsker binning to achieve the most general distribution among (^2, C^ipb, C^2p&), but interestingly, carries no message. This 
feature of the capacity achieving scheme does not provide a clear intuition on the role of the RV U2pb- For this reason we 
present two examples of deterministic channels where the encoders can choose their respective codebooks in a way that allows 
binning of the interference without rate splitting. To make these examples more interesting we choose them so that they do 
not fall into the category of the "very strong interference regime" of Theorem 1111.61 that in the deterministic case reduces to 

H{Yi\X2) < H{Y2\X2) 

H{Y2) < H{Yi) Vpxi,x.. (40) 

Unfortunately, checking for the "very weak interference condition" of Theorem IIII.5I is not possible as no cardinaUty bounds 
on U are available. 



A. Example I: the "Asymmetric Clipper" 

Consider the channel in Fig.|4] The input and output alphabets are "Xi — — {0, 1, 2, 3} and X2 = ^2 = {0, 1, 2, 3, 4, 5, 6, 7} 
and the input/output relationships are 

Yi^Xi®iX2, (41) 

= l{2,3}(^l)®8+^2, (42) 

where 1a{x) — \ if x G A and zero otherwise, and ®m denotes the addition operation over the Galois field Sjv defined as 
the modulo sum over elements in the finite field {l...A^}. Also let U(S) be the uniform distribution over the set §. 
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First we show that the channel in (l42l l does not fall in the "very strong interference" class. 
Consider the input distribution: 

^2 -lt(X2). 

For this input distribution, we have Yi ^ U(yi) and Y2 ^ 11(^2), so that 

H{Y2) - log(|y2|) = 3 > 2 = log(iyil) = H{Y,) 

which does not satisfy the "very strong interference" condition of ( l40b . 
For this channel we have: 

//(ri|X2)<i/(Yi)<iog(|yi|) = 2 
if(>2)<iog(|y2|) = 3 

HiY,\X2,Y2) < i?(Xi|l{2,3}(^l)) < 1- 

where the last bound follows from the multiplicity of the solutions of an addition in a Galois field. This shows that the outer 
bound in Theorem IIX.II is included in 

Ri < 2 (43a) 
i?2 < 3 (43b) 
R1+R2 < 4. (43c) 

We now show that the region in ( |43] ) indeed corresponds to the Theorem IIX.II when considering the union over all input 
distributions. The corner point {Ri,R2) — (1,3) in ( |43] | is obtained in Theorem IIX. 1 1 with the input distribution: 

^U({0,1}) 

^2 -U(X2). 

The corner point (_Ri,i?2) = (2,2) in (l43l l is obtained in Theorem IIX. 1 1 bv considering the input distribution: 

Xi - U(Xi) 

X2 U(X2). 

Time sharing shows that the region of ( l43T l and the region of Theorem IIX. 1 1 indeed coincide. 
We next show the achievability of the corner point {Ri, R2) ~ (1, 3). Consider the following strategy: 

> transmitter 2 sends symbols from X2 = {0...7} with uniform probability, 

> transmitter 1 transmits [a:i — a;2]2 (where the inverse of the difference operation is taken over the ring 92); 

> receiver 1 decodes wi — [^J; 

> receiver 2 decodes W2 — y2- 

It can be verified by inspection of TableUthat the rate pair (i?i, R2) = (1, 3) is indeed achievable. 

TABLE I 

Achievability FOR {Ri,R2) = (1, 3) in Example I in Section Ix-AI 







W2 


XI 


X2 
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y2 
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10 
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12 




4 


1 


4 





1 


4 


1 


13 




5 
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14 
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1 


15 
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Now we show the achievability of the corner point (i?i, R2) = (2, 2). Consider the following strategy: 

• transmitter 2 sends symbols from X2 G {0, 2, 4, 6} with uniform probability; 

• transmitter 1 transmits [a;i — X2]4 (where the inverse of the difference operation is taken over the ring 94); 

• receiver 1 decodes wi — yi', 
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TABLE II 

ACHIEVABILITY TABLE FOR THE RATE POINT (-Rl, if,2) = (2, 2) IN EXAMPLE I IN SECTIOn IX-AI 







iU2 
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y2 
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Xl e {0,1,2,3} 



Yi e {0,1} 



Tx 1 




1{1,2} 







Rx 1 



40:1} 



Vi 



Tx 2 



^ 1/1 9 



{1.2} Vn 




Rx 2 



X2 e {0,1,2} 



e {0,1,2,3} 



Fig. 5. "Symmetric Clipper" of Section IX-BI 



• receiver 2 decodes W2 — [^\- 
It can be verified by the inspection of Table HIl that the rate pair R2) — (2, 2) is indeed achievable. 

In this example we see how the two senders jointly design the codebook to achieve the outer bound and in particular how 
the cognitive transmitter 1 adapts its strategy to the transmissions from the primary pair so as to avoid interfering with it. 

In achieving the point (i?i,i?2) — (1,3), transmitter 2's strategy is that of a point to point channel. Transmitter 1 chooses 
its codewords so as not to interfere with the primary transmission. Only two codewords do not interfere: it alternatively picks 
one of these two codewords to produce the desired channel output. For example, when the primary message is sending W2 = 
(line and 8 in Table lUi transmitter 1 can send either 1 or 2 without creating interference at receiver 2. On the other hand, 
these two values produce a different output at receiver 1, allowing the transmission of 1 bit. 

In achieving the point {R1. R2) — (2,2), the primary receiver picks its codewords so as to tolerate 1 unit of interference. 
Transmitter 1 again chooses its input codewords in order to create at most 1 unit of interference at the primary decoder. 
By adapting its transmission to the primary symbol, the cognitive transmitter is able to always find four such codewords. It 
is interesting to notice the tension at transmitter 1 between the interference it creates at the primary decoder and its own 
rate. There is an optimal trade off between these two quantities that is achieved by carefully picking the codewords at the 
primary transmitter. For example, when the primary receiver is sending L02 — Q (lines 0, 4, 8 and 12), transmitter 1 can send 
Xl G {0, 1, 2, 3} and create at most 1 bit of interference at receiver 2. Each of these four values produces a different output 
at receiver 1], thus allowing the transmission of 2 bits. 

B. Example II: the "Symmetric Clipper" 

Consider the now channel in Fig. |5] The channel input and output alphabets are Xi = {0,1,2,3} = ^2, "^2 G {0,1,2}, and 
^1 = {0, 1}. The input/output relationships are: 

Yl = l{l,2}(Xl)©2l{1.2}(X2) 
Y2 - l{0,l}(Xl)©X2 
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TABLE III 

The input distribution for Example 11 



X2 Xi 
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1/8 


1/8 
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1/8 


1/8 
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3/8 


1/8 


1/8 





TABLE IV 

ACHIEVABILITY TABLE FOR THE RATE POINT {Rl,R2) = (1,2) IN EXAMPLE II. 
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Consider the input distribution: Consider the input distribution: 

P[Xi - 3] = 1, 

X2^1[({1,2}), 

in this case H{Yi) ~ and H{Y2) = 1. This shows that there exists at least one input distribution for which H{Y2) > H{Yi) 
and thus this channel is not in the "very strong interference" regime. The outer bound of Theorem IIX. II is achieved here by a 
single input distribution pxi,X2- consider the distribution in Table Hill This distribution produces H{Yi) = 1 = log2(|yi|) and 
H{Y2) = 2 = log(|y2|) and clearly no larger outer bound can exist given the output cardinality. We therefore conclude that 
the region of Theorem IIX. 1 1 can be rewritten as: 

i?i < 1 
i?2 < 2. 

This region can be shown to be achievable using the transmission scheme described in Table HV] The decoding is simply 
Cji = Yi, i € {1,2}. This transmission scheme achieves the proposed outer bound, thus showing capacity. The transmission 
scheme can be described as follows: 

• encoder 2 transmits [x2 — 1]^; 

• encoder 1 transmits the value Xi that simultaneously makes Yi = oji and Y2 =002- For each loi and uj2 such a value 
always exists because X2 takes on only three possible values; 

• receivers 1 and 2 decode Cji ~ Yi and 0)2 — Y2- 

This example is particularly interesting since both decoders obtain the transmitted symbol without suffering any interference 
from the other user Here cognition allows the simultaneous cancelation of the interference at both decoders. Encoder 2 has 
only three codewords and relies on transmitter 1 to achieve its full rate of R2 = 2. In fact encoder 1 is able to design its 
codebook to transmit two codewords for its decoder and still contribute to the rate of primary user by making the codewords 
corresponding to L02 — {2,3} distinguishable at the cognitive decoder 

This feature of the capacity achieving scheme is intriguing: the primary transmitter needs the support of the cognitive 
transmitter to achieve R2 — 2 since its input alphabet has cardinality three. The transmitters optimally design their codebooks 
so to make the effect Xi on both outputs the desired one. 

For example consider the transmission of a;2 = 2 or 3 (lines 2,3,6 and 7). In this case transmitter 1 sends a;i = or 
Xl = 1 to simultaneously influence both channel outputs so that both decoders receive the desired symbols. This simultaneous 
cancelation is possible due to the channel's deterministic nature and the extra message knowledge at the cognitive transmitter. 

XI. Conclusion 

In this paper we focused on the discrete memoryless cognitive interference channel and derived new inner and outer bounds, 
derived the capacity region for a class of "better cognitive decoding" channels, and obtained the capacity region for the 
semi-deterministic cognitive interference channel. We proposed a new outer bound using an idea originally devised for the 
broadcast channel in (30). This outer bound does not involve auxiliary RVs and is thus more easily computable. Our outer 
bound is in general looser than the outer bound in |4| and they coincide in the "strong interference" regime of |6|. We also 
proposed a new inner bound that generalizes all other known achievable rate regions. In particular we showed the inclusion 
of the region of ll24l . ||3TI : it was previously unclear how the performance of the scheme in It24il . Ii31i compared with that 
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of Other achievable rate regions. We determined capacity for a class of channels that we term the "better cognitive decoding" 
regime. The conditions defining this regime are looser than the "very weak interference condition" of f4l and the "very strong 
interference condition" of [6| and is the largest region where capacity is known. We also determined the capacity region for the 
class of semi-deterministic cognitive interference channels where the output at the cognitive receiver is a deterministic function 
of the channel inputs. Furthermore, for channels where both outputs are deterministic functions of the inputs, we showed the 
achievability of our new outer bound. This result shows that our outer bound, even though looser than the outer bound in 
is tight for certain channels. The scheme that achieves capacity in the deterministic cognitive interference channel uses 
Gelf'and-Pinsker binning against the interference created at the primary receiver This binning is performed by the cognitive 
encoder for the cognitive decoder. This feature of the transmission scheme was never known before to be capacity achieving. 
We conclude the paper by presenting two examples that show new interesting features of the capacity achieving scheme in the 
deterministic cognitive interference channel. Extensions of the results presented here to Gaussian channels will be presented 
in L32J. 
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Appendix 

A. Error analysis of the achievable region Urtd of Theorem W.l\ 

Without loss of generality assume that the message (wic, W2c, W2pa,wipi,,W2pb) — was sent and let (&0 7^i7^2) be 

the tuple {bo, bi, 62) chosen at encoder 1. Let {wic, W2c, W2pa, 'W2pb, bo, 62) be the estimate at the decoder 2 and {wic, W2c, wipb, &o 
be the estimate at the decoder 1. 

The probability of error at decoder u, u G {1,2}, is bounded by 

P [error u] < P [error m| encoding successful] + ^[encoding NOT successful]. 

An encoding error occurs if encoder 1 is not able to find a tuple (60,61,62) that guarantees typicality. A decoding error is 
committed at decoder 1 when (wicWipb) 7^ (1, !)■ A. decoding error is committed at decoder 2 when (w2c, '2'2pa, '2'2ph) 7^ 
(1,1,1). 



B. Encoding Error 

The probability that the encoding fails can be bounded as: 



P [encoding NOT successful] = P 



n n 

I \bo = l I l6i = l I I&2 = 1 

(C/2^(1), (1, 1), U(i{l, 1, bo), [/i^,(l, 1, bo, 1, 61), U^p,{l, 1, 60, 1, 62)) i Tf (pc/..,x„t/..,t/,,..c/.,J 
where 

2««'lc 2"'''ipi' 2"-"2pb 

^ = ^ ^ X! ^"^boMM 
bo = l 61 = 1 62 = 1 

and 

^fto,6i,fc2 = l{(c/«Jl),X«(l,l),C/«Jl,l,bo),C/f,,(l,l,bo,l,bi),C/2"p.(l^l^boa,b2))eT~(p„^^_^^ 

where l{xeA} = 1 if a; G A and zero otherwise. 

The mean value of K (neglecting all terms that depend on e and that eventually go to zero) is: 

2™-^ic 2"-^'ip'' 2"''2pi' 

E E ^[^''o,b„b2 = i]-2^(«-+«ip^+«ip-^) 

ho = l 61 = 1 62 = 1 

with 

2-NA ^ p[KbM2 - 1] = ^^[^6o,fti,62] 

= P[{Uii{l), (1, 1), [/i^(l, 1, bo), C/i^fc(l, 1, 60, 1, 61), Ug,il, I, bo, 1, 62)) e Tf (pc7,^.x2,c/,.,c/i,„c/2,J] 

= E PUla\U2aPU2pt\U2c,Ula,X2PUipt\U2a.Ul^ 

«.<6.<.)6Tf (Pl/2c.X2.t/lc,^lp.,^2pJ<.=^2") 
2-^[-f('^l<=;^2|;72c) + /(;7lp6;X2,(72p6|;7le,;72c)] 

The variance of A' (neglecting all terms that depend on e and that eventually go to zero) is: 

2™-"ic 2"'''ip'' 2"''2pb 2™-^i<: 2"-^'ip'' 2"-^2pb 

Var[i^] = E E E E E E (^[^bo,fc.,b2 = l-^bi.fc;.^ = 1] - ^[^6„,bi,b2 = l]^[^b^.fc;,^ - 1]) 

bo = l hi = l 62 = 1 b'„ = l b[ = l b'^ = l 

E (^[^fc0,6l,fc2 = l,i^fco,bi,^ - 1] - ^^[^60,61,62 - l]P[Kb„,b',.,b'2 = 1]) 

feo=''oX''lib2,fci,fc2) 

^ E Pi^boMM = l,-f^fco,fci,^ = 1] 

feo,(i>l,''2,fci,b2) 
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because when 60 7^ ^o, that is, Uf^{...,bo) and C/]'^(..., &q) are independent, the RVs i^6o, 61,62 ™d ^6^,6;, 6^ are independent 
and they do not contribute to the summation. We thus can focus only on the case bo = bg. We can write: 

Var[i^] < J2 P[Kb,MM = 1] 

V ' 

=E[K] 

+ X] P[KboMM = l]-P[-f^f)o,fci,b2 = l|^f'o,&i,b2 = 1] 

5o, 61 =6^, 627^62 

^ V ' 

+ X] -^[-^^60,61, 62 = l]-P[-^6o,6i,62 = l|-^6o,6i,62 = 1] 

60, biT^bj, 62=62 

^ V ' 

=B[K]2''<«ip»-''' 

+ -P[-f'^6o,6i,62 = l]-P[-f'^6o,6i,6^ = l|-K'6o,6i,62 = 1] 

60, 61 5^6'i, 62/62 

^ ^ 



and 



= P[{UZ{1), X^{1, 1), 1, 60), C/i^6(l, 1, &0, 1, &l), C/2p6(l, 1, ^'0, 1, ^2)) e (pc/,^,;f„c/,^,c/,^„£/,^J| 

(C/2^(l), (1, 1), U^,{1, 1, &o), f/i^6(l, 1, bo, 1, 61), 1/2^^6(1, 1. bo, 1, 62)) e Tf (pt,, 

!/2„6)] 

= X PC/2p6|i^2c,C/lc,^2 



ST'." (Pr /2c . X2 , L-i„ r/i^, . L-2P, K -«fc ."j;^) 

2-JV/({72p6;£/lp(,|£/2c,f/lc,^2) 



and 

2-jvc ^ P[Ki,g^},[,b2 = 11^60,61,62 = 1] 

= P[{U^,{l),Xi'{l,l),U^,{l,l,bo),U,%b{l,'^,boA,b[)Mgbihl,boAM))& 

(UZil), X^{1, 1), C/i^(l, 1, bo), C/i^(,(l, 1, 60, 1, foi), C/a'^bll, 1, ^'0, 1, ^'2)) G I^J^(P[/2c,X2,£/i.,[/ip„[/2pJ] 

= X] P;7lp6|f/2c,f/lc 

"J^,i,eT/'(p[72c,X2,l/lc,C/lp6>C/2p6l"X.^^ ."fc^^fc) 
^ 2-NI{Ulpi;X2,U2pb\Ulc,U2c) ^ 

and 

2-Jvr> ^ p[/;fj^^ j^,^ ^ = l\Kbo,bi,b2 = 1] 

= P[{UZ{1),X^{1, 1), C/i^(l, 1, 6o), U^pbih 1, &0, 1, C/2p6(l, 1, ^0, 1, &2)) e 7f (py,,,;c2,C/ie,C/ip.,C/2pJI 

(1, 1), U.'iil, 1, 6o), C/i'^feCl, 1, ^'0, 1, bi), Ugbi^, 1, bo, 1, 62)) G Tf (pt,, 

U2pb)] 

= X] Pi/2p6|i72c,i7lc,^2f'C/lp6|(/2c,i7lc 

«p,'<,)^T^''(PU2..X2.U,^.U,p,.U2p,{uZ,^¥,U^J 
^ 2-^-'"('^lpi';'^2'^2pl,|!7lc,(72c) _ 2"''^'^ 

Hence, we can bound P[K = 0] as: 

-, , oN(Kpb-C) , 2^(«2pi,-B) _L 2^(«ipi.+«2pi,-C) 

< P\K = 01 < ^^^^ — 

and P[ii' = 0] -)■ if 

^'ic + Kb + Kb - HUic, X2\u2c) -c>o 

-R'lc + R'lpb + -R2P6 ~ I{Uic; X2\U2c) - C - {R'2pb - B) > 

R'lc + R'lpb + -R2p6 - HUic, -^^21^^20) - C - {R'lpb - C) > 

K + Kb + -^2p6 ~ I{Ulc; -^2|C^2c) - C - {R'lpb + R'2pb - C) > 
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TABLE V 
Error events at decoder 2. 



Event 


W2c 


(wic, hi) 


W2pa 


U>2pb 


mi* 


E2.1 


X 










E2,2a 


1 


X 


X 






-£'2,26 


1 


1 


X 






E2,-ia 


1 


X 


1 


X 


PY2|C72c,Jf2 


-£'2,36 


1 


1 


1 


X 


-Pf2|C/2c,C/1c,-^2 



that is, if 

P'lc + P'lpb + -^2p6 > I{Uic\ ^2|t^2c) + I{Ulpb\ X2, U2pb\Ulc, U2c) 

= liUic, Uipb-, X2\U2c) + HUlpb-, U2pb\Ulc, U2c, X2) 
R'lc + R'lpb > HUlc, X2\U2c) + HUlpb; X2\Uic, U2c) 
^I{Uia,Uipb;X2\U2c) 

R[, + R^pb> l{Uic;X2\U2c), 
R[,> I{Uic;X2\U2c) 
as in ( ll2al )-( fT2cl ). because the second to last equation is redundant. 

C. Decoding Errors at decoder 2 

If decoder 2 decodes {w2c,W2pa,W2pb) 7^ (1, Ij 1)> then an error is committed. The probabiHty of error at decoder 2 is 
bounded as; 

P[error 2 1 encoding successful] < -P[-E'2.i], 

iG{l,2a,26,3a,36} 

where i?2,i, i € {1, 2a, 2b, 3a, 36}, are the error events defined in Table IVl In Table FVl an "X" means that the corresponding 
message is in error (when the header of the column contains two indices, an "X" indicates that at least one of the two indexes 
is wrong), a "1" means that the corresponding message is correct, while the dots indicates that "it does not matter 

whether the corresponding message is correct or not; in this case the most restrictive case is when the message is actually 
wrong." The last column of Table IVl specifies the Py2\* to be used in ( fTSl) . 
We have that P [error 2 [encoding successful] — >■ when — > cx) if: 

• When the event i?2,i occurs we have 'W2c 7^ 1- In this case the received is independent of the transmitted sequences. 
This follows from the fact that the codewords C/^ are generated in an iid fashion and all the other codewords are generated 
independently conditioned on U2c- Hence, when decoder 2 finds a wrong J7^, all the decoded codewords are independent 
of the transmitted ones. We can bound the error probability of E2.1 as: 



P[E2,l] = P 



u 



{yf, U2c{w2c),U^^{wic, W2c, bQ),X2{w2c, W2pa) , U2pb{w2c, W2pa, Wlc, ^0, W2pb, 62)) S {PY2.U2a.Ula.X2,U2pb)] 

< 2N{R2a + R2p.+Rla+R[^+R2j,b + R'2pb) ^ P2|*|* = 

ivS',^Z,U^.^^^,ug,)eT«{p-,,,U2,,U,,,X2.U2p,) 
<; 2^(-R2c + -R2pa + -R.lc+-R.'ie + fl2pi. + -R-2pi,--f2|*U = 0) 

for p2|* given in ( fTSl l and I2\^, given in ( fTTb . Hence P[p2.i] as iV ^ cx) if (I12dl ) is satisfied. 

When the event i?2,2 occurs, i.e., either i?2,2a or -E'2.26, we have W2c = 1 but W2pa 7^ 1- Whether wic is correct or not, 
it does not matter since decoder 2 is not interested in wic. However we need to consider whether the pair (wicbo) is 
equal to the transmitted one or not because this affects the way the joint probability among all involved RVs factorizes. 
We have: 

- Case p2,2a: either wic 7^ 1 or 60 7^ bo- In this case, conditioned on the (correct) decoded sequence U^^, the output 
is independent of the (wrong) decoded sequences t//^, X^ and also of (because [/^^ is superimposed to 
the wrong pair {U{^^, X^)). It is easy to see that the most stringent error event is when both wic ^ 1 and bo ^ bo. 
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Thus we have 



P[E2,2a] =P U 
{Yf, Uii{l), U^^{l,Wic, &o), (1, W2pa),U2pb{^, W2pa,Wic, bo, W2pb, ^2)) G (pYs ,(72o,C/lo,X2 ,C/2pb ) ] 

< 2^(«2pa+-Rlc+-R'ie + fl2pb + fl2p6--f2|* U = !72c) 



for p2|* given in ( fTSI l and given in (fTTb . Hence P[i?2,2a] — > as iV — cx) if il2ei is satisfied. 

60 = bo- In this case, ( 
is independent of the (wrong) decoded sequences {X2 , C^2pf))- Thus we have 



- Case E2.2b- both wic — 1 and bo = bo- In this case, conditioned on the (correct) decoded (J7^, U^^), the output 



P[E2^2b] =P U 

'5'2pa5^1,tt'2p6,''2 

(r2^,t/2'^(l),;7i^(l,l,So),X2^(l>2pa),f/2'^fc(l>2pa,l,6o,^?2p6,62)) G (Pl2,C/2e,C/l..X2,[/2p.)] 
<2^(«2p„ + «2p. + «ip.) Y: P2\.l=iU2...U..) 

fe" ,< ,< ) 6^." (PV2 , t.2c. C'lc,:^2 . £/2p6 ) 

^ 2-'^(-'^2pa + -R2p6 + -R2pb^-'^2|* \* = (U2a,V i^)) 

for P21* given in (fT6l l and /2|^ given in (fTTI ). Hence P[i?2,2fc] — > as — > 00 if ( I12fb is satisfied. 
When the event i?2,3 occurs, i.e., either i?2,3a or i?2,3h, we have W2c — ^,W2pa = 1 but W2ph 7^ 1- Again, whether wic is 
correct or not, it does not matter since decoder 2 is not interested in wic- However we need to consider whether the pair 
{wic, bo) is equal to the transmitted one or not because this affects the way the joint probabiHty among all involved RVs 
factorizes. The analysis proceeds as for the event i?2,2- 
We have: 

- Case i?2,3a: either wic 7^ 1 or 60 ^ bo- In this case, conditioned on the (correct) decoded sequences {U^^, X^^), the 
output Y^ is independent of the (wrong) decoded sequences {U(^^, t^2c^2p6)- ^^^Y ^'^ '^^at the most stringent 
error event is when both wic 7^ 1 and bo ^bo- Thus we have 



P[E2,3a] = P 



u 



wic^l,bo^bo,W2pb,b2 



{Yi' ,Ul{l),U^^{l,w,,M),X^{l,l),U^^,{l^,w,,M,w^^^^ 

< 2^(«-+«'-+«^^^+^i.^) J2 P2\.U=iU2.,X2) 

ivS',uZ,U^,,^^,ul,)&T«{p^^,U2,.U,,.X2.U2pb) 

< 2^(«2pa + fllc+i?i,+i?.2pb+fl.2pb--f2|*l* = ([/2,,X2)) 

for p2|* given in (fTSI l and /2I* given in ( fTTI ). Hence P[-E2,3a] — as A'^ cxo if ( 12gi is satisfied. 



- Case E2.3b' both wic — 1 and bo = bo- In this case, conditioned on the (correct) decoded sequences (C^2c:^2 1 t^ic)' 
the output Y^ is independent of the (wrong) decoded sequence t/^f,. However, since (J/ici^i^; ^ic) the triplet 
that passed the encoding binning step, they are jointly typical. Hence, in this case we cannot use the factorization in 
P2|* given in (fTST l. but we need to replace Puia\U2a C3 with Puia\U2a,X2- Thus we have 



P[E2,3b] = P 



u 



«'2p6,02 



(^2^^, Uiiil), 1, 60), ^2^(1, 1), (72^^,(1, 1, 1, 60, W2pb, 62)) e iPY2,U2.,U,.,X2,U2pb)] 

< 2^(^^'"'+-^2pb) PU2cPX2\U2cPUl,\U2a,X2PU2pb\U2a.Ul,.X2PY2\Ul,,U2a,X2 

fe2",<,<.^2"><b)6T/'(pV2.L/2.,C/l.,X2.C/2pb) 

< 2^(-R2pb+-R.2pi,--f(y2;;72pb|(7lc,(72c,X2)) 



Hence P[-E2.36] — as 00 if (I12hl i is satisfied. 
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TABLE VI 
Error events at decoder 1. 



Event 


W2c 


(Wlc, f)l) 








X 








El,2 


1 


X 




PYl\U2a 


El,3 


1 


1 


X 


Pyi\U2^,Ui^ 



D. Decoding Errors at Decoder 1 

The probability of error at decoder 1 is bounded as: 

3 

P[error 1| encoding successful] < P[Ei^i], 

where P[Ei,i] is the error event defined in Table IVTl The meaning of the symbols in Table IVll is as for Table IV] We have that 
P [error 1| encoding successful] — s> when — s> cxd if: 

• When the event Ei^i occurs we have W2c / 1. In this case the received Y-^ is independent of the transmitted sequences. 
We can bound the error probability of i?i i as: 



P[E: 



u 

«'2c#l,li'lc,li'lpb,6o,&l 

{Y^^ ,U2c{w2c),Ujl{wic,W2c,bQ),U^pl,{w2c,W^^ € (pYi ,C/2c ,C/lc ,(7ip6 )] 

< 2^(-R2c+i?.2pa+fllc+fl/i, + fl2pb + fl2p6--fl|*U=») 

for given in ( fTSI l and I^^ given in ( l20b . Hence P[i?i.i] as iV — )■ oo if ( I12il i is satisfied. 

When the event Ei^2 occurs, either wic ^ 1, bo ^ bo or both. In this case, conditioned on the (correct) decoded sequence 
J7^, the output is independent of the (wrong) decoded sequences U{^^ and U^)^ . It is easy to see that the most 
stringent error event is when both wic ^ 1 and 5o ^ 6o- Thus we have 



P{El,2\ = P 



u 



W/1 ~ . ^ ttN 



{Y,'',Uiiil),U,'iil,wic,bo),U,''p,{l,wi,,bo,wipb,bi)) S Tf {pyuU2.,u,.,u,J] 



for given in ( fT9] l and given in ( l20b . Hence P[i?i.2] — as A'^ — > oo if (12j i is satisfied. 
« When the event Pi. 3 occurs, either vuipb ^ 1, bi ^ bi or both. In this case, conditioned on the (correct) decoded sequence 
C/;^ and U^), the output Y^ is independent of the (wrong) decoded sequences U^i^. It is easy to see that the most 
stringent error event is when both wipi, ^ 1 and bi ^b\. Thus we have 



P[Pl,3] = P U 

.Cjipb#l,fci#fci 

(yi^,[/2^(l),[/l^(l,l,6o),t^l'^fc(l,l,6o>l;,6,&l)) e Tf (pn,C/2.,C/,.,[/,pJ] 
^ 2^{Rla+R'^^■^Rlvb+R!x■l,b-l^^* = u2^.Ul^) 



for given in ( fT9] l and given in ( l20b . Hence P[Pi_3] — > as iV — > 00 if (I12kl) is satisfied. 
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E. Proof of Lemma \V.3\ 

An encoding error is committed if we cannot find a bo in the first step or if, upon finding the correct bo in the first encoding 
step, we cannot find the correct (61,62) in the second step. Let E^.o the probabiUty of the first event and £'e,i2 of the latter, 
than: 

P[encoding NOT successful] < P[Ee„o] + P[Ee, 12] Elo] 

where 

P[E,,o] -PinCt (t^2^(l),^2^(l,l),C^i^(l,l,&o)) (Pa..,x.,r/.J] 

= (1 - P[(C/2^(1), (1, 1), C/,^(l, 1, 60)) ^ ^,^(pc/..,x..c/.J]^"'^^ 
Using standard typicality arguments we have 

Now we can write 

P[Ee,o] < (1 - (1 - e)2^(^(^i-^^l^-=)+^))2""'^= 

< exp (1 - (1 - £)2^(«ie-^(t^ic;^2|c/2c)+5))^ 

so that P[Ee,o] ^ 1 when iV ^ if ( I21al ) is satisfied. 
Now the error event £"£,12 can be divided in three distinct error events: 

. £^£,21 a- it is not possible to find 61 such that {Uii, Xi' ,U('^,U^t,) G T^^ {pu2.,X2,Ui,,u^^,), 
. Ee,2i b- it is not possible to find 62 such that {U^,,X^ , U^^, Ug^) G T^"" {pu2.,x,,u,.,u,J- 

• Ee,2i c Given that we can find bi and 62 satisfy the first two equations, we cannot find a couple (61,62) such that 

{U2c,X^, C/i^, U^p^, U^p^) G {PU2a,X2,Ula,Ulpb,U2r,b)- 

We now establish the rate bounds that guarantee that the probability of error of each of these events goes to zero. 
For i?e,2i a we have: 

P[£e,21a] =il-P[{ui^Ai),Xi^{ia),UrAia,bo),U,^p,ilA,boA,b,))iTf{pu2^^X^^^^^^ 

where 

^[(C/iKl), ^2^(1, 1), C/i^(l, 1, bo), C/i^,(l, 1, 60, 1, 61)) ^ T,^{pu2.,X2..u....u.J] > (1 - e)2~^(nx2;U.^.\U2M-^s)_ 

As for Eefl, this impHes that P[Ee,2i a] 1 when iV — if illhh is satisfied. 

For £'e,2i b, we have that the probability of this event goes to one for large N given that (U2C, X2,Uic) appear to be 
generated according to the distribution Pu2a,X2,Uia U2pb is generated according to Pu2pb\U2c,X2,Ui^- 

For i?e,2i c we have: 

P[Ee,21 c] = (1 - P[([/2^(l), (1, 1), [/i^(l, 1, 60), f/i^,(l, 1, 60, 1, 61), C/i^,(l, 1, 60, 1, 62)) 

f \PU2a,X2,Ula,Ulpi,U2j.l,)\) 

where 

P[(C/2^(1), ^2^(1, 1), C/i^(l, 1, 60), C/i^fc(l, 1, 60, 1, 61), t/i^,(l, 1, 60, 1, 62)) i T^{pu2.,X2,u,..u,,,,U2j]) < 2'"+'- 
This implies that P[i?e,2i c] ^ 1 when iV ^ if ( I21cl ) is satisfied. 

E Containment of [24. Thm. 1 ] in 'Rrtd of Section IV7-AI 

We refer to the region in 1241 Thm. 1] as 'Rdmt for brevity. We show this inclusion of 'JIdmt in ^rtd with the following 
steps: 

• We enlarge the region 'JIdmt by removing some rate constraints. 

• We further enlarge the region by enlarging the set of possible input distributions. This allows us to remove the Vii and Q 
from the inner bound. We refer to this region as 3?x3A/t since is enlarges the original achievable region. 



• We make a correspondence between the RVs and corresponding rates of J^^^jx ^nd OIrtd- 

out 



• We choose a particular subset of JIbtd, ^^rtd^ ^^i" which we can more easily show JIdmt ^ J^^dIj-y C J^^rtd — ^RTD, 
since J^'dIit ^'^'^ -^^rtd have identical input distribution decompositions and similar rate bound equations. 
Enlarge the region JIdmt 

We first enlarge the rate region of ll24l Thm. 1], JIdmt by removing a number of constraints (specifically, we remove equations 
(2.6, 2.8, 2.10, 2.13, 2.14, 2.16 2.17) of |l24] Thm. 1]). Also, following the line of thoughts in |l33l Appendix D] it is possible 
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to show that without loss of generality we can set Xi to be a deterministic function of Vu and V12, allowing us insert Xi 



next to Vii, V12. With these consideration we can enlarge the original region and define ^'jjIjt follows. 

= IiV2i;Xi,Vn,Vi2\W) (44a) 

i?22 = /(V^22;^i,T^ii,T^i2|W^) (44b) 

Rii < I{YuVi2,V2i;Vn\W) (44c) 

R21 + R21 < I{Yi,Xi,Vn,Vi2;V2i\W) (44d) 

+ i?2i + i?2i < I{Yi,Vi2;Vn,V2i\W)+I{Vn]V2i\W) (44e) 

+ i?2i + i?2i + i?i2 < I(Yi;Xi,Vn,Vi2,V2i\W)+I{XuVn,Vi2;V2i\W) (44f) 

i?22 + i?22 < /(>^2, 1^2, ^21; 1^2211^) (44g) 

R22 + R22 + R2I + R21 < I(Y2,Vi2;V22,V2l\W) + I{V22;V2l\W) (44h) 

i?22 + i?22 + ^21 + i?21 + ^12 < /(i"2 ; ^22 , ^21 , '^^12 | VF) + /(\^22 , ^21 ; ^12 | W^) (44i) 

taken over the union of distributions 

PwPViiPVi2Pxi\Vii,Vi2Pv2i\ViiVi2Pv22\Viiyi2Px2\Vii,Vi2,V2i.y22- (45) 



Using the factorization of the auxiliary RVs in ll24l Thm. 1], we may insert Xi next to Vu in equation ( I44f1 l. 
For equation ( I44cl i: 

< /(ri,yi2,y2i;Vii|M^) 

= /(Fi , ^21 ; Vii 1^12 , + /(1/i2 ; 14i I W^) 

= /(yi,y2i;i^ii|i42,T^) 

= /(yi,y2i;^i,i4i|'t^i2,i4^) 

= I{Yi ■,Xi,Vn\Vi2, V21 , W) + I{V2i ; Xi , Vii 11/12 , W^). 

For equation ( I44el i we have: 

+ i?21 + i?21 < I{Yl,Vi2;ViuV2l\W)+I{Vii-V2l\W) 

= /(Fi; Fii, ^211^12, VK) + /(V12; Fn, t/2i|iy) + /(Fii; F21IW) 

= I{Yi- Vii,V2i\Vi2, W) + I{Vi2:V2i\Vii,W) + I{Vii-V2i\W) 

= /(Yi ; Vii , V21 1^12 , W) + /(Vii , F12 ; V2i\W) 

= /(Yi ; Xi , t/n , t/21 1 V12 , M/) + /(Xi , , V12 ; 1^2i | W^) 



The original region is thus equivalent to 

R'2^ = I{V2uX^,Vii,Vi2\W) (46a) 

i?^2 - I{V22\X^,Vii,Vi2\W) (46b) 

< I{YuXi,Vii\Vi2,V2i\W)+I{V2i;X^\Vi2.W) (46c) 

R21 + R21 < I{Yi,Xi,Vn,Vi2;V2i\W) (46d) 

+ i?2i + i?2i < /(Yi;Xi,yn,y2i|l^i2,W^)+/(^i;l/2i|W^) (46e) 

i?ii + i?2i + i?2i + ^12 < I{Yi;Xi,Vn,V2i,Vi2\W)+I{Xi,Vn,Vi2;V2i\W) (46f) 

R22 + R22 < I(Y2,Vi2,V2i;V22\W) (46g) 

R22 + R'22 + R2I + R2I < nY2,Vi2;V22,V2l\W) + I{V22;V2l\W) (46h) 

i?22 + i?22 + ^21 + i?21 + i?12 < /(Y2; 1/22, 1^21, 1^12|Vt^)+/(1^22, 1^21; 1^12|W/) (46i) 

union over all distributions that factor as in ( l45b . 
Enlarge the class of input distribution and eliminate Vn and W 



Now increase the set of possible input distributions of equation |45] bv letting Vn have any joint distribution with Vi2- This is 
done by substituting pv^ with Pvii|Vi2 '^he expression of the input distribution. With this substitution we have: 

PwPyii|yi2PVi2Pxi|yii,Vi2Py2i|-^i,ViiVi2Py22|Xi,yii,yi2Px2|Jfi,yii,yi2,V2i,V22 
= PwPyi2Pyii,Xi|yi2Py2i|Xi,yiiyi2Py22|Xi,yii,yi2Px2|Xi,yii,yi2,y2i,y22 

= PWPVl2PX{\Vl2PV2l\X{ ,Vl2PV22\X{ ,Vl2PX2 | ( ,yi2 ,y2i ,y22 

with X[ = {Xi,Vii). Since V12 is decoded at both decoders, the time sharing random W may be incorporated with V12 
without loss of generality and thus can be dropped. The region described in ( |46] ) is convex and thus time sharing is not needed. 
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TABLE Vn 
Assignment of RVs of Appendix[f] 



RV, rate of Theorem |V. 1| 


RV, rate of (24j Thm. 1] 


Comments 


U2c,R2c 


Vl2,-Rl2 


TX 2 RX 1, RX 2 


Ulc, Ric 


V21,-R21 


TX 1 ^ RX 1, RX 2 


Uipb, Rlpb 


V22 , -R22 


TX 1 ^ RX 1 


X2 , R2pa 


X[,Rii 


TX 2 RX 2 


U2pb = D, R'2pb = 




TX 1 RX 2 


R'lc = I{Ulc;X2\U2c) 


L21-R21 = /(V2i;Vii,Vi2) 


Binning rate 


R[pb = HUipb;X2\Ui„U2c) 


-f'22 — -R22 = 1(^22', Vil, V12) 


Binning rate 


Xi 


X2 





With these simphfications, the region ^i)mx 1^°^ defined as 









R'21 




I{V2i;X{,Vi2) 










R'22 




I{V22;X[,Vi2) 










Rii 


< 


I{Yi;X[\Vi2,V2i)^ 


-I{V2i;Xi\Vi2) 






R2I - 


^R'21 


< 


I{YuX[,Vi2;V2i) 






Rii - 


Ki?21 - 


^R'21 


< 


I{Yi; X[,V2i\Vi2)^ 


-I{Xi-V2i) 


Rii - 


- R21 - 


^R2i- 


^Rl2 


< 


I{Yi;X[,V2i,Vi2)- 


^ I{X[,Vi2]V2l) 






R22 - 


'r R'22 


< 


I{Y2,Vi2,V2i;V22) 




R22 - 


^ R'22 - 


Ki?21 - 


^R'21 


< 


I{Y2,Vi2;V22,V2l)- 


f /(V^22;V^2l) 


R22 + R'22 " 


Ki?21- 


^^21- 


^Rl2 


< 


I{Y2;V22,V21,Vi2) - 


f /(V^22,V^2i;V^12) 



taken over the union of all distributions 



(47a) 
(47b) 
(47c) 
(47d) 
(47e) 
(47f) 
(47g) 
(47h) 
(47i) 



PViaPx; IViaPVai [XJ , |x; ,^12^X2 \X[ ,^12,^21,^22 ■ 

Correspondence between the random variables and rates. When referring to Il24l please note that the index of the primary 
and cognitive user are reversed with respect to our notation (i.e 1^-2 and vice-versa). Consider the correspondences between 
the variables of ll24l Thm. 1] and those of Theorem IV. II in Table [VTI] to obtain the region defined as the set of rate 

pairs satisfying 









-R'lc 




I{Uic;X2,U2c) 




(48a) 








Rlpb 




I{Uipb; X2, U2c) 




(48b) 


R2pa 


+ Ric 


+ R'lc 


+ R2C 


< 


I{Y2;Ui„U2c,X2)^ 


h/(X2,t/2c;C/lc) 


(48c) 




R2pa 


+ Ric 


+ R'ic 


< 


I{Y2;X2,Uic\U2c)^ 


-/(^2;C/ic) 


(48d) 






Ric 


+ 


< 


I{Y2,X2,U2c;Uic) 




(48e) 








R2pa 


< 


I{Y2:X2\U2c,Uic)^ 


-I{Ui,,;X2\U2c) 


(48f) 


Rlpb + R'lpb 


+ Ric 


+ R'lc 


+ R2c 


< 


I{Yi;Uipb,Uic,U2c) 


+ I{Uipb,Uic;U2c) 


(48g) 


Ric + 


Rlpb ~ 


h i?'i, 4 


^ R'lpb 


< 


I{Yi,U2c;Uipb,Uic) 


+ I{Uipb; C/ic) 


(48h) 






-Rlpb 4 


" R'lpb 


< 


I{Yi,U2c,Uic;Uipb) 




(48i) 



taken over the union of all distributions 

PC/2cPX2|(72cP(7lc|X2PC/ip,|X2PXi|X2,(7lc,C/lp6- (49) 

Next, we using the correspondences of the table and restrict the fully general input distribution of Theorem IV. II to match 
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thp mnrp pon^trainpH fa f tori yation of d4.Qil ohtai ni n 




ion 




*iati *ifvin (T 




R'lc 




I{Uic;X2\U2c) 


(50a) 




Ripb 




I{X2;Uic,Uipb\U2c) 


(50b) 


R2c + Rlc + R2pa - 


^R'lc 


< 


I{Y2;U2c,Uic,X2) + I(Uic;X2\U2c) 


(50c) 


R2pa + -Rlc - 


1- -Kic 


< 


I{Y2;Uic,X2\U2c) + I{Uic;X2\U2c) 


(50d) 


-Rlc - 


1- -Kic 


< 


I{Y2;Uic\U2c,X2) + I{Uic;X2\U2c) 


(50e) 




R2pa 


< 


I{Y2;X2\U2c,Uic) + I{Uic;X2\U2c) 


(50f) 


-Ripb + R'lpb + Rlc + R'lc " 


^R2c 


< 


nYi;U2c,Uic,Uipb) 


(50g) 


Rlc + Ripb + R'lc + 


R'lpb 


< 


I{Yi;Uic,Uipb\U2c) 


(50h) 


-Rlpb + 


Ripb 


< 


I{Yi;Uipb\U2c,Uic) 


(50i) 



union of all distributions that factor as 

PU2c,X2PUl^\X2PUipi\X2PXi\X2,Ul^,Uipb- 

Equation-by-equation comparison. We now show that ^'j^mt — ■^'rtd fixing an input distribution (which are the same 
for these two regions) and comparing the rate regions equation by equation. We refer to the equation numbers directly, and 
look at the difference between the corresponding equations in the two new regions. 

• ( I50cb -( l50al l vs ( I48cl l-( l48al i: Noting the cancelation / interplay between the binning rates, we see that 



50cl) - (|50i) - ((gSdli - (I48al) ) 



. (I50dli-(l50al) vs. (I48dli-(l48all: 



0. 



(I50al) ) - (gSdl) - (I48al i) 

^-IiX2;Uic) + IiUic;X2,U2c) 

= I{U2c\Uic\X2) 

= 



( I50al l vs. ( I48el l-( l48al ): again noting the cancelations, 

C dSOel) - dSOill) - f (l48el) 

dSOft vs. (|48B: 



gSaJl) = 



(|50g]l-(|50bli vs. (|48gll-(l48bli-(l48al) 

(( |50gl ) - (I50b]i) - ((|48g]i - gSB 



3) 

-/(X2;f/lc,[/lp6|C/2c) 

-/(C/ipb, C/ic; C/2c) + /(C/ie; C/2C, X2) + /(C/ipfc; [/2c, ^2) 
= -/(C/ipb, ;7ie; X2, [/2c) + /([/ic; [/2c, X2) + I{Uipb; [/2c, ^2) 
= ~I{Uipb; X2, [/2c) - /([/ic; X2, U2c\Uipb) + /([/ic; [/2c, ^2) + /([^ipb; [/2c, ^2) 

= -/([/lc;X2,[/2c|[/lp6) +/([/lc;[/2c,^2) 

= -H{Uic\Uipb) + H{Uic\X2, U2c, Uipb) + i[([/ic) - H{Uic\X2,U2c) 
- /([/ic; [/ipb) > 

where we have used the fact that Uic and Uipb are conditionally independent given ([/2c, -'^^2)- 
dSOiil l - dSObll vs. (1483 - dSbl) - digal i: 



f dSOiil) - (|50bll) - ((1483 - dSB - (l48al l) 

- -/(X2; [/ic, [/ip6|[/2c) - /([/2c; [/ic, [/ipb) + /([/ipb; [/2c, ^2) - /([/ipb; [/ic) + /([/ic; ^2, [/2c) 

= -I{X2, [/2c; [/ic, [/ipb) + /([/ipb; [/2c, ^2) - /([/ipb; [/ic) + /([/ic; ^2, [/2c) 

= -I{X2, [/2c; [/ipb) - /([/ic; ^2, [/2c|[/ip6) + /([/ipb; [/2c, ^2) - /([/ipb; [/ic) + /([/ic; ^2, [/2c) 

= -/([/ic; ^2, [/2c, [/ipb) + /([/ic; ^2, [/2c) 

= -/([/ic; X2, [/2c) - /([/ic; [/ipfc|^2, [/2c) + /([/ic; ^2, [/2c) 
= 



where we have used the fact that Uic and Uipb are conditionally independent given {U2ctX2 
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= —I{Ulpb', X2, U2c,Uic) + liUipb] U2c,X2) 
= -I{Uipb;Uic\U2c,X2) 
^ 



G. Containment of l[26\ Thm. 2 ] in 'Rrtd of Section [G] 

The independently derived region in ifTOl Thm. 2] uses a similar encoding structure as that of ^rtd with two exceptions: 
a) the binning is done sequentially rather than jointly as in 'Rrtd leading to binning constraints (43)-(45) in flO', Thm. 2] as 
opposed to (I12ab -( fl"2cl i in Thm l V. 1 1 Notable is that both schemes have adopted a Marton-like binning scheme at the cognitive 
transmitter, as first introduced in the context of the CIFC in ifTOl . b) While the cognitive messages are rate-split in identical 
fashions, the primary message is split into 2 parts in ITOl Thm. 2] = Rn + Riq, note the reversal of indices) while we 
explicitly split the primary message into three parts R2 = R2C + R2pa + R2pb- We show that the region of ifTOl Thm.2], denoted 
as Jicc ^ "Rrtd in two steps: 

• We first show that we may WLOG set Uu = in [TO', Thm.2], creating a new region Rqq. 

• We next make a correspondence between our RVs and those of |10 Thm.2] and obtain identical regions. 
We note that the primary and cognitive indices are permuted in |10|. 

We first show that Uu in ifTOl Thm. 2] may be dropped WLOG. Consider the region Jlcc of [TOl Thm. 2], defined as the 



union over all distributions PUw,Uii,Vii,V2o,V22,Xi,X2PYi,Y2\Xi,X2 of all rate tuples satisfying: 

Ri < IiYi;Vii,Uii,V2o,Uio) (51a) 

R2 < I{Y2;V2o,V22\Uio) - I{V22,V2o;Un\Uio) (51b) 

R1+R2 < I{Yi; Vn, Un\V2o, t/10) + I{Y2; V22, V20, Uw) - /(I/22; f/11, IF20, C/10) (51c) 

R1+R2 < I{Y,;Vn,Un,V2o, Uw) + /(Fs; V^22|1^20, Uw) - /(I/22; Uii,Vii\V2o, Uw) (51d) 
2R2 + Ri < I{Yi-Vn,Uii,V2o\Uio) + I{Y2; V22|1^2o, C/10) + I{Y2; V20, V22, C/10) 

-/(V22; Un,Vn\V2o, C/10) - /(V22, V20; C/ii|C/io) (51e) 



Now let 'JI'qq be the region obtained by setting [/{j^ = and = (Vii, C/n) while keeping all remaining RVs identical. 
Then IR'^p is the union over all distributions Puio,v{^:V20:V22:Xi,X2PYi,Y2\Xi,X2^ with Vd ~ (V^ii,C/ii) in 3?cc, of all rate 
tuples satisfying: 

Ri <I{Yi;Vii,Un,V2o,Uio) (52a) 

R2 < I{Y2;V2o,V22\Uio) (52b) 

R1+R2 < I{Yi;Vn,Un\V2o, C/10) + I{Y2; V22, V20, C/10) " /(1^22; Un,Vn\V2o, C/10) (52c) 
R1+R2 < I{Yi;Vn,Un,V2Q, C/10) + /(>2; 1^22|V^20, C/10) - /(V22; Un,Vn\V2Q, C/10) (52d) 
2i?2 + Ri < I{Yi;Vn,Un,V2o\Uio) + I{Y2\ ^2211^20, C/10) + I{Y2\ X^20, 1^22, C/10) 

-/(V^22;C/ii,yii 1^20, C/10) (52e) 

Comparing the two regions equation by equation, we see that 
. ^l^= (l52al) 

• ( I51bb < ( I52bl ) as this choice of RVs sets the generally positive mutual information to 
. (l5Tcll=(l52cli 

. (l5Tdli=(l52dl) 

• ( I51eb < ( I52eb as this choice of RVs sets the generally positive mutual information to 

From the previous, we may set Uu = in the region 3icc of Thm. 2] without loss of generaUty, obtaining the region 
defined in ( I52ab - ( I52eb . We show that '^'qq may be obtained from the region 'JIrtd with the assigment of RVs, rates 
and binning rates in Table IVIIII 
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TABLE VIII 
Assignment of RVs of SectionIgI 



RV ratp nf Thpnrpm IV 1 1 


RV ratp nf 1941 Thm 11 
IW, idle \JL ±11111. ij 


Comments 






TY 
lA Z — 


-? K A 1, K A Z 






TX 2 - 


-> RX 2 




V20 1 ^20 


TX 1 


^ RX 1, RX 2 


U\pb, Rlpb 


V22 , ^22 


TX 1 - 


^ RX 1 


U2ph^ R2pb 




TX 1 - 


-> RX 2 




i20 — R20 






Kpb 


L22 — R22 








Lii — Rii 






Xi 


X2 






X2 


Xi 







Evaluating 'JIqq defined by (I52al l - (I52el i with the above assignment, translating all RVs into the notation used here, we 
obtain the region: 



-R'lc 










> 







^Ipb ~ 


^ R'lpb 








> 


l{Ulpb'; U2pb\Ulc, Ul 


R2pb - 


1" Rlpb 








< 


I{Yi 


Ulpb\U2c, Uic) 


Rlpb - 


1" Rlpb ~ 


\- Ric - 


h R'lc 




< 


I{Yi 


Ulc, U2pb\U2c) 


Rlpb - 


1" Rlpb ~ 


hi?lc- 


^ R'lc H 




< 


I{Yi 


Ulc, U2c, U2pb) 


Rlpb - 


1" Rlpb 








< 


I{Yi 


Ulpb\U2ci Ulc) 


Rlpb ~ 


1" Rlpb ~ 


1- Ric - 


1- R'lc 




< 


I{Yi 


Ulpb, Uic\U2c) 


Rlpb - 


1" Rlpb ~ 


\- Ric - 


h R'lc H 




< 


I{Yi 


Ulpb, Ulc, U2c) 



Note that we may take binning rate equations R[^ > and R[pi, + R'2pb ^ I{Uipb\ U2pb\U2c, Uic) to be equality without loss 
of generality - the largest region will take R\^, R'lpbi R'2pb small as possible. The region 'JIrtd with i?2pa — 



R'lc 










> 







R'lc^ 


R'lpb 








> 







R'lc 4- 


Rlpb ^■ 


R'lpb 






> 


I(Uipb; U2pb\U2c, Ul 


Rlpb - 


^ Rlpb 








< 


I{Y2 


U2pb\U2c, Ulc) 


Rlpb - 


^ Rlpb ' 


f i?lc- 


yR'ic 




< 


I{Y2 


Ulc, U2pb\U2c) 


Rlpb ' 


^ Rlpb ' 


f Ric - 


H R'lc H 


hi?2c 


< 


I{Y2 


Ulc, U2c, U2pb) 


Rlpb ' 


^ Rlpb 








< 


I{Yi 


Ulpb\U2c, Ulc) 


Rlpb - 


^ Rlpb ^ 


^Ric- 


1- R'lc 




< 


HYi 


Ulpb, Uic\U2c) 


Rlpb - 


^ Rlpb ' 


f Ric - 


V R'lc H 


hi?2c 


< 


HYi 


Ulpb, Ulc, U2c) 



For i?']^^ = these two regions are identical, showing that 'Rrtd is surely no smaller than IRcc- For R'^^ > 0, 'JIrtd , the 
binning rates of the region 'JIrtd are looser than the ones in Jicc- This is probably due to the fact that the first one uses 
joint binning and latter one sequential binning. Therefore JIrtd may produce rates larger than Jicc- However, in general, no 
strict inclusion of Jicc in JiRTD has been shown. 

H. Containment of M5\ Thm. 4.1] in JiRTD of Section \VI-C\ 

In this scheme the common messages are created independently instead of having the common message from transmitter 1 
being superposed to the common message from transmitter 2. The former choice introduces more rate constraints than the 
latter and allows us to show inclusion in JiRTD- 

Again, following the argument of [33, Appendix D], we can show that without loss of generality we can take Xi and X2 
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TABLE IX 
Assignment of RVs of SectionIhI 



Kv, rdLc oi 1 ncurcm i v. 1 1 


KV, Idle oi iz^i inm. ij 


Comments 


U2c,R2c 


Ui,Ri2 


TY 
lA Z — 


-? K A 1, K A Z 


^2 ) J^2pa 


V R' 

VI 1 ^11 


TX 2 - 


-> RX 2 


Ulc, Rlc 


U2,R21 


TX 1 


^ RX 1, RX 2 


U\pb, Rlpb 


W2,R22 


TX 1 - 


^ RX 1 


U2pb, R2pb = 


Wi 


TX 1 - 


-> RX 2 


R'l. 


L20 — R20 






R'lpb 


Lii — Rii 






R'2pb 


L22 — R22 






Xi 


Xo 






X2 


Xi 







to be deterministic functions. With this consideration we can express the region of f25l Thm. 4.1] as: 







R22 


> 


I{W2;Vi,Xi\Ui,U2) 




R'll ' 


^ R'22 


> 


I(W2;WuVi,Xi\Ui,U2 




Rii - 


yR'ii 


< 


I(VuXi,Wi;Yi\Ui,U2) 


i?12 - 


\- Rii - 


^R'li 


< 


I(Ui,VuX,,Wi;Yi\U2) 


R21 - 


K i?ll - 


^R'li 


< 


I{U2,Vi,Xi,Wi;Yi\Ui) 


Ri2 + R21 " 


^Rii - 


^R'li 


< 


I{Ui,Vi,XiWi,U2;Yi) 




R22 - 


^R'22 


< 


I{W2;Y2\Ui,U2) 


R2I - 


I- R22 - 


^ R'22 


< 


I{U2,W2;Y2\Ui) 


R12 - 


- R22 - 


^ R'22 


< 


I{Ui,W2;Y2\U2) 


i?12 + R2I " 


- R22 - 


^ R'22 


< 


I(Ui,U2,W2;Y2). 



taken over the union of all distributions 

PUiPVi\UiPXi\Vi,UiPU2PWi,W2\Vi,Ui,U2PXo\Wi,W2,Vi,Ui,U2PYi,Y2\Xi,Xo 

for {R[-^^, R'22, Rii, R12, R211 R22) e K^. 

We can now eliminate one RV by noticing that 

PUiPVi\UiPXi\Vi,UiPU2PWuW2\Vi,Ui,U2PXo\Wi,W2,Vi,Ui,U2PYi,Y2\Xi,X„ 

= PUiPVuXi\UiPU2PWi,W2\Vi,Ui,Xi,U2PXo\Wi,W2,Vi,Ui,Xi,U2PYi,Y2\Xi,Xoj 

and setting V{ = [Vi,Xi], to obtain the region 







R'22 


> 


HW2;Vl\Ui,U2) 




R'll 


+ R'22 


> 


I{W2;Wi,v;\Ui,U2 




Rii 


+ R'ii 


< 


I{Vl,Wi;Yi\Ui,U2) 


R12 - 


^Rii 


+ R'ii 


< 


I{Ui,V;,Wi;Yi\U2) 


R21 - 


^Rii 


+ R'ii 


< 


I{U2,V{,Wi;Yi\Ui) 


R12 + R21 " 


^Rii 


+ R'ii 


< 


I{Ui,V{WuU2;Yi) 




R22 


+ R'22 


< 


I{W2;Y2\UuU2) 


R21 " 


^ R22 


+ R'22 


< 


I{U2,W2;Y2\Ui) 


R12 - 


^ R22 


+ R^22 


< 


I{Ui,W2]Y2\U2) 


R12 + R21 - 


^ R22 


+ R^22 


< 


I{Ui,U2,W2;Y2) 



taken over the union of all distributions of the form 

PUiPVl\UiPU2PWi,W2\Vl,Ui,U2PXo\Wi,W2. VI ,Ui,U2PYi.Y2\ V{ .. Xo ■ 

We equate the RVs in the region of ||25]| with the RVs in Theorem IV. II as in Table IIXI 



(53a) 
(53b) 
(53c) 
(53d) 
(53e) 
(53f) 
(53g) 
(53h) 
(53i) 
(53j) 



(54a) 
(54b) 
(54c) 
(54d) 
(54e) 
(54f) 
(54g) 
(54h) 
(54i) 
(54j) 
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With the substitutions of Table |IX] in the achievable rate region of ( l54l i. we obtain the region 









Rlpb 


> 


I{Uipb; X2\U2c, Uic) 


(55a) 






R'lpb ~ 


' R'2pb 


> 


I{Uipb; U2pb, X2\U2c, Uic) 


(55b) 






R2pa - 


^ R2pb 


< 


I{X2,U2pb;Y2\U2c,Uic) 


(55c) 




R2c ^ 


- R2pa - 


' R2pb 


< 


I{U2c,X2,U2pb;Y2\Uic) 


(55d) 




Ric ^ 


- R2pa - 


' R'2pb 


< 


I(Uic,X2,U2pb;Y2\U2c) 


(55e) 






- R2pa - 


' R'2pb 


< 


I(U2c,X2,Uic,Uipb;Y2) 


(55f) 






Rlpb ' 


^ R'lpb 


< 


nUipb;Yi\U2c,Uic) 


(55g) 




Ric - 


f Rlpb ~ 


" R'lpb 


< 


I{Uic,Uipb;Yi\U2c) 


(55h) 




R2c - 




h -R'lpb 


< 


I{U2c,Uipb;Yi\Uic) 


(55i) 


Ric - 


f Ric - 


f Rlpb - 


h -R'lpf, 


< 


I{U2c,Uic,Uipb;Yi). 


(55j) 



taken over the union of all distributions of the form 

Set i?2p6 = and ~ I{Uic; X2\U2c) in the achievable scheme of Theorem IV. 1 1 and consider the factorization of the 
remaining RVs as in the scheme of i55[ . that is, according to 

PUlaPU2aPX2\U2aPUlpb.U2pb\Ula.U2a.X2PXi\U2a.X2Ml^Mlpb,U2pb- 

With this factorization of the distributions, we obtain the achievable region 









■R'lc 




I{Uic;X2\U2c) 


(56a) 








Rlpb 


> 


I{Uipb;X2\U2c,Uic) 


(56b) 






^'ipb 


+ -^2pb 


> 


liUipb] X2, U2pb\U2c, Uic) 


(56c) 






R2pa 


+ -R2pb 


< 


IiY2;X2,U2pb\U2c,Uic) + 


I{Uic;X2\U2c) (56d) 




-RicH 


- R2pa 


+ -R2pb 


< 


I{Y2;Uic,X2,U2pb\U2c) 


(56e) 


R2c H 


h-RicH 


- R2pa 


+ -R2pb 


< 


I{Y2;U2pb,Uic,U2c,X2) 


(56f) 






Rlpb 


+ -R'lpb 


< 


I{Yi;Uipb\U2c,Uic) 


(56g) 




-Ric - 


1- -Rlpb 


+ -R'lpb 


< 


I{Yi;Uic,Uipb\U2c) 


(56h) 


R2c - 


fi?lc- 


1^ -Rlpb 


+ R'lpb 


< 


I{Yi;U2c,Uic,Uipb) 


(56i) 


icular factorization 


we have that I {Uic, X2\U2c) ~ 0, since 


X2 is conditionally independent of Uic 



given U2c- 

We now compare the region of i55i and ( l56l l for a fixed input distribution, equation by equation: 



(l56bll = §5^ 



56a = §M 
(l56dl) = §5^ 



Sm = (155^ 
dSli = (|55i 



56g 



56h 



55g 



55h l 



We see that ( I55db and 



I]) are extra bounds that further restrict the region in f25l to be contained in the region of Theorem 



